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Preface 


The  present  volume  is  Final  Report  supposed  to  be  submitted  by  schedule  of  works  in  accordance 
with  the  contract  No.  F61775  98-WE116.  The  Report  describes  all  accomplishments,  results  and 
conclusions  of  the  research  for  contract.  In  particular,  it  contains  results  of  research  on  the  following  tasks: 

•  Mathematical  analysis  of  existing  diagnostic  procedure  utilizing  statistical  clustering  and  Bayes’ 
techniques  and  further  development  of  mathematical  basis  and  algorithm  for  diagnostic  model 
design. 

As  a  whole,  these  results  formed  a  new  technology  of  assessment  and  prognosis  of  the  probability  of 
failure  of  the  hardware  like  avionics; 

•  Development  of  software  package  implementing  all  steps  of  cluster  analysis  of  statistical  data  and 
further  phases  of  the  developed  technology. 

This  software  made  it  possible  to  verify  and  validate  the  developed  technology  numerically.  A 
peculiarity  of  the  developed  software  is  (1)  the  interactivity,  (2)  the  utilization  a  computer  graphics  to 
facilitate  previewing  of  the  clustering  pattern  in  particular  subspaces  and  (3)  the  use  of  the  modem 
software  engineering  environment  like  Visual  €++  5.0  and  MS  Access-97; 

•  Application  of  Algebraic  Bayes’  Network  approach  to  various  classes  of  applications  of  experts’ 
analysis  techniques  to  diagnostic  related  problem; 

•  Application  of  classical  regression  analysis  to  estimation  of  remaining  life  expectancy  (residual 
performance  resource)  of  hardware  subjected  to  adverse  effects. 

Advantage  of  the  developed  regression  model  is  that  it  is  based  on  Dynamic  Data  Model  (DDM)  what 
has  made  it  possible  to  design  the  regression  model  utilizing  ideas  of  prognosis  of  time  series. 

All  results  provided  by  contract  are  presented  in  the  sufficient  details  in  the  Final  Report. 
Nevertheless,  the  Interim  Report  ITR-98]  has  to  be  considered  as  the  indefeasible  part  of  the  former  report 
because  some  aspects  of  the  developed  knowledge  engineering  technology  for  diagnostic  related  problem 
solving  are  considered  in  the  latter  in  more  details.  In  addition.  Interim  Report  presented  more  numerical 
results  which  were  not  repeated  in  Final  Report. 

Final  Report  consists  of  sk  sections. 

The  Section  1  is  introductory.  It  outlines  the  application  itself  and  related  prognostic  tasks,  main 
focus  of  the  research  and  contents  of  the  Report. 

Section  2  presents  Dynamic  Data  Model  that  was  developed  as  alternative  to  Static  Data  Model 
which  was  used  for  numerical  validation  of  the  developed  prognostics  model  in  the  first  phase  of  research 
resulting  in  Interim  Report  [IR-98]. 

Section  3  presents  developed  variants  of  the  regression  models  and  their  comparative  numerical 
analysis. 

Section  4  describes  main  results  related  to  the  task  of  knowledge  discovery  fi-om  statistical 
database  to  design  information-based  health  assessment  system.  This  question  is  a  focus  of  the 
research. 

Section  5  is  devoted  to  the  application  of  Algebraic  Bayes’  Network  approach  to  the  diagnostic 
related  knowledge  engineering  tasks.  Theoretical  part  of  the  section  coincides  mainly  with  the  respective 
material  given  in  the  Interim  Report  [IR-98].  In  contrast,  it  contains  much  more  numerical  results  which 
were  calculated  on  the  basis  of  the  developed  software. 

Section  6  has  to  be  considered  as  the  general  conclusion  of  the  research.  It  outlines  the  main  results, 
research  contribution  and  summarizes  the  most  perspective  future  research  in  the  fi-amework  of 
prognostics  and  related  topics  of  Knowledge  Discovery  from  Data  (KDD)  technology. 

This  contents  reflects  the  tasks  formulated  within  contract. 

Contractor 

Chief  Scientist  of  the  St.  Petersburg  Institute  for  Informatics 

and  Automation  of  the  Russian  Academy  of  Sciences 
Ph.D.  Prof. 


Vladimir  Gorodetski. 
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1.  Introduction 


Safe,  reliable  and  efficient  operation  of  avionics  is  crucial  for  a  modem  aircraft  or  spacecraft.  >^^le 
in  operation,  avionics  components  are  exposed  to  electrical  perturbations,  mechanical  vibrations, 
excessive  temperatures,  humidity,  etc.  These  adverse  conditions,  acting  individually  and  in  combination, 
are  known  to  have  cumulative  effects  leading  to  avionics  performance  degradation  md  failmes.  Until 
recently,  it  was  virtually  impossible  to  obtain  data  characterizing  performance  of  individual  units.  At  the 
present,  availability  of  dedicated  monitoring  systems  and  like  devices  allows  for  the  collection  of  large 
amounts  of  actual  data  of  any  particular  unit  of  aircraft  hardware.  Based  on  this  data,  modem  Data  Mining 
techniques,  common  in  technology  of  Knowledge  Discovery  from  Data  (KDD),  made  it  possible  to 
facilitate  formulation  and  solution  of  important  on-line  and  off-line  prognostic-relat^  problems. 

These  new  possibilities  for  hardware  monitoring,  for  on-line  and  off-line  prognostic  related 
problem  solving  predetermined  the  tasks  that  are  the  subject  of  the  contract.  According  to  the  contract  the 
research  presented  in  this  Report  aimed  at  the  development  of  mathematical  models,  algorithms  and 
software  for  solving  the  following  tasks; 

•  accurate  assessment  of  the  probability  of  failure  of  hardware,  such  as  avionics,  on  the  basis  of  its 
known  history  of  abuse  by  environmental  and  operational  factors; 

•  prognosis  of  the  probability  of  failure  of  hardware  at  a  given  time  in  the  future,  for  example,  at  the 
end  of  the  forthcoming  sortie  of  the  aircraft; 

•  accurate  assessment  of  the  residual  performance  resource  of  hardware  on  the  basis  of  regression 
model  and  its  known  history  of  abuse  by  environmental  and  operational  factors  and  known 
cumulative  time  of  maintenance  (number  of  sortie). 

These  task  statement  was  prompted  by  the  modem  concept  of  maintenance  known  as  the 
"service  when  needed"  [Skormin  et  al-97].  Let  us  consider  the  peculiarity  of  the  above  task  statement 
compared  to  the  traditional  one. 

Traditionally,  reliability  of  any  technical  device  (electronic,  electro-mechanical,  and  mechanical)  is 
defined  in  terms  of  such  characteristics  as  the  average  time  of  normal  (no-failure)  operation.  These 
reliability  concepts  referring  to  a  statistically-generic  device  may  be  considered  acceptable  as  long  as  the 
failures  are  caused  by  the  factors  related  to  manufacturing.  At  the  present,  this  approach  is  not  always 
acceptable.  Manufacturers  of  electronics,  due  to  completely  automated  processes,  have  achieved  a  veiy 
high  degree  of  reliability  of  their  products  and  very  little  variation  in  properties  from  device  to  device. 
Manufacturing-related  effects  on  failures  of  electronics  are  gradually  becoming  less  significant.  The  main 
causes  of  failures  are  traced  now  to  the  individual  operational  and  environmental  conditions  of  particular 
units.  Therefore,  the  average  time  of  normal  operation  and  other  "traditional"  reliability  characteristics, 
defined  without  taking  into  account  actual  “history  of  abuse”  of  a  device,  are  becoming  less  important. 

Classical  reliability  had  a  good  reason  for  addressing  a  statistically-generic  device.  At  that  time  it 
was  virtually  impossible  to  obtain  data  characterizing  performance  of  individual  units  in  various  operating 
environments.  At  the  present,  availability  of  Time-Stress  Measurement  Devices  (TSMD)  [Popyack-98], 
smart  sensors  and  data  acquisition  systems  makes  possible  to  collect  large  amount  of  actual  data  of  any 
particular  unit  of  aircraft  hardware.  Based  on  this  data,  modem  Data  Mining  techniques,  common  in 
Knowledge  Discovery  from  Data  (KDD)  technology,  facilitate  formulation  and  solution  of  important  on¬ 
line  and  off-line  reliaWlity-related  problems.  The  most  important  problem  is  forecasting  the  probability  of 
failure  of  flight-critical  units  of  aircraft  hardware  during  a  forAcoming  sortie.  Solving  such  problem 
implies  the  investigation  of  the  role  of  various  environmental  factors  in  the  development  of  particular 
failures,  investigation  of  combined  effects  of  several  factors,  reevaluation  of  probability  of  failure  on  the 
basis  of  known  exposure  to  particular  adverse  conditions,  as  well  as  development  of  special  types  of 
mathematical  models  and  model-based  techniques. 

Data  Mining  and  KDD  address  specific  practical  needs  for  solving  above-mentioned 
problems.  Data  Mining  provides  a  wide  spectmm  of  available  techniques  and  tools  to  develop  a  KDD 
technology  focusing  on  design  of  a  mathematical  model  for  particular  application  ([Frawley  et  al  -91], 
[Matheus  et  al  93],  [Fayyad  et  al-95-1],  [Fayyad  et  al  95-2],  [Bradley  et  al-98-1]).  It  is  well  known  that 
every  particular  application  possesses  specific  properties  that  require  either  the  ability  to  adapt  already 
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existing  Data  Mining  techniques  or  develop  new  ones  to  build  an  adequate  and  efficient  technology  of 
original  data  processing  aimed  at  a  particular  model  development. 

As  per  common  understanding  [Fayyad  et  al-95-1],  a  KDD  process  considered  herein  consists 
of  a  number  of  Data  Mining  procedures  that,  regardless  of  domain  and  particular  task,  conceptually, 
include  such  steps  as  (1)  definition  of  the  goal  of  the  task,  (2)  collection  or  model-based  generation  of 
adequate  statistical  data  and  its  preprocessing,  (3)  data  reduction,  transformation  to  find  useful  data 
patterns  and  its  specifications  and  representations,  including  visualization  if  possible,  (4)  development 
of  a  KDD  strategy  that,  actually,  corresponds  to  the  outline  of  the  future  technology  as  a  number  of 
steps  of  Data  Mining,  (5)  selection,  adaptation  or  development  of  Data  Mining  methods  and 
algorithms  intended  for  the  realization  of  the  accepted  KDD  technology  (search  of  informative  subsets 
of  attributes  and  pattern  of  interest,  separation  and  decision  making  rules  creation,  features,  regression 
model  development,  etc.),  (6)  interpretation  of  the  Data  Mining  results  and  incorporation  of  these 
results  into  a  target  model,  (7)  testing  and  validation  of  the  resultant  model. 

Steps  of  this  KDD  process  are  usually  iterative  and  interactive  and  are  common  for  any  KDD 
process.  Nevertheless,  from  the  algorithmic  and  implementation  points  of  view,  particular  KDD 
processes  may  be  implemented  in  very  different  ways.  It  is  well  known  that  the  best  universal 
approach  does  not  exist.  Moreover,  the  wider  the  area  of  possible  utilization  of  an  algorithm  or 
approach,  the  lesser  its  efficiency.  Therefore,  taking  into  account  the  domain  and  task  specifics, 
combined  with  the  experience  in  KDD  technology  and  Data  Mining,  assures  the  successful  solution  of 
any  particular  application  problem.  Then,  following  such  a  principle  in  the  framework  of  tasks 
predetermined  by  contract,  we  developed  an  approach  that  consists  of  traditional  steps  of  KDD 
process  but  its  application  reflects  the  following  framework: 

•  peculiarities  of  the  goal  of  the  task  (prognosis  of  probability  of  failure  of  avionics); 

•  original  statistical  data  available  for  diagnostic  and  prognostic  model  design  (for  example,  TSMD- 
based  records  of  cumulative  exposure  to  environmental  factors  and  operational  conditions); 

•  the  need  for  a  highly  dependable  model-based  prognostic  procedure; 

•  requirement  of  a  reliable  assessment  of  probabilities  involved  in  the  calculation  of  the  probability 
of  failure  of  a  hardware  even  if  the  size  of  statistical  data  is  small; 

The  Report  is  organized  as  follows. 

In  the  first  phase  of  research  reflected  in  the  Interim  Report  [IR-98]  we  considered  «histoiy  of 
abuse»  specified  by  the  vector  of  adverse  exposures  of  a  unit  operation  and  environmental  conditions. 
This  data  model  may  be  called  “Static  Data  Model”  (SDM),  because  it  doesn’t  take  into  account  the 
history  of  failure  development.  It  was  reasonable  to  use  such  simplified  data  model  to  focus  research 
on  the  mathematical  aspects  regarding  the  task  of  prognostics. 

Unfortunately,  SDM  is  not  appropriate  for  development  of  the  precise  regression  model  aimed  at 
residual  performance  resource  forecasting.  It  will  be  justified  below  that  to  solve  the  last  task  we  need 
a  model  that  reflects  the  history  of  failure  development  for  a  particular  device,  i.e.  we  need  a  model 
that  makes  it  possible  to  specify  file  «trajectory»  of  failure  development  for  a  particular  unit.  It  is 
reasonable  to  call  the  model  that  makes  it  possible  to  obtain  trajectory  of  failure  development  of  a 
particular  unit  as  “Dynamic  Data  Model”  (DDM).  In  the  next  section  (Section  2)  we  present  the 
developed  Dynamic  Data  Model. 

Section  3  is  devoted  to  the  presentation  of  the  developed  variants  of  the  regression  models  for  the 
forecasting  of  the  residual  performance  resource  of  the  hardware  and  its  comparative  numerical  analysis. 
It  was  obtained  numerically  that  traditional  regression  model  that  doesn’t  takes  into  account  the  history  of 
a  failure  development  of  the  particular  device  doesn’t  possess  the  required  precision  of  residual 
performance  resource  forecasting.  Instead,  the  regression  model  designed  on  the  basis  of  DDM  model  of 
failure  development  seems  to  be  much  more  advantageous.  The  corresponding  regression  model  was 
developed  and  is  described  in  Section  3.  Additionally,  this  section  contains  results  of  numerical 
investigation  of  the  regression  procedure  parameters  that  are  sensitive  regarding  to  the  precision  of  the 
assessed  residual  performance  resource. 

In  Section  4,  to  assess  probability  of  failure  of  avionics,  the  developed  technology  of  the  model- 
based  prognosis  system  is  presented.  Actually,  these  results  were  presented  in  detail  in  the  Interim 
Report  [IR-98].  Nevertheless,  they  outlined  here  in  brief  and  are  extended  by  some  new  numerical 
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results  obtained  due  to  newly  developed  software.  This  section  contains  a  brief  description  of  the 
heuristic  informativity  criteria  that  are  used  in  a  general  case  for  the  preliminary  selection  of 
informative  subspaces  of  low  dimension.  These  procedures  are  demonstrated  numerically  for  two- 

dimensional  case.  . 

A  notion  of  a  classification  predicate  is  defined  and  a  number  of  approaches  to  obtam  such 
predicates  are  proposed.  We  use  a  visualization  technique  that  makes  it  possible  for  a  developer  to 
draw  a  separation  bound  of  any  arbitrary  form  approximated  by  linear  spline  and  to  generate  the 
associated  classification  predicate  automatically.  Then  we  describe  the  main  principle  behind  the 
design  of  decision  trees  and  associated  probabilistic  spaces  that  form  a  set  of  decision  procedures. 
Since  the  major  purpose  of  the  model  under  development  in  this  section  is  the  assessment  of  the 
probability  of  failure  of  a  hardware  unit,  in  Section  4  we  consider  the  way  of  improvement  of  the 
precision  and  reliability  of  this  assessment  using  the  small  size  of  experimental  data  and  experts’ 
knowledge.  We  present  the  numerical  results  as  an  example  of  an  implementation  of  the  outlined 
technology  for  the  development  of  a  model-based  prognostic  procedure  for  a  particular  avionics 
module. 

Section  5  is  devoted  to  the  application  of  Algebraic  Bayes’  Network  approach  to  the  dia^ostic 
related  knowledge  engineering  tasks.  Theoretical  part  of  the  section  coincides  in  main  aspects  with  the 
respective  material  given  in  the  Interim  Report  [IR-98].  In  contrast,  it  contains  much  more  numerical 

results  that  were  calculated  on  the  basis  of  additionally  developed  software. 

Section  <5  has  to  be  considered  as  the  general  conclusion  of  the  research.  It  outlines  the  main  results, 
research  contribution  and  suimnarizes  the  most  perspective  future  research  in  the  framework  of 
prognostics  and  related  topics  of  Knowledge  Discovery  from  Data  (KDD)  technology.  They  may  be 
considered  as  the  topics  of  the  proposals  for  eventual  future  research. 
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2.Dynamic  Data  Model  of  Failure  Development 


2.1.  Dynamic  Data  Model  vs.  Static  Data  Model 

According  to  the  problem  statement  [IR-98]  health  assessment  system  of  a  device  on  the  board 
of  an  aircraft  aims  at  solving  two  major  tasks: 

•  Evaluation  of  the  residual  performance  resource,  given  “history  of  abuse”,  and 

•  Prognosis  of  the  probability  of  failure  at  a  given  time  in  the  future,  for  example,  at  the  end 
of  the  forthcoming  sortie. 

In  the  first  phase  of  research  reflected  in  the  Interim  Report  [IR-98]  we  have  considered 
«history  of  abuse»  that  was  specified  by  the  vector  of  adverse  exposures  of  a  device  operation  and 
environmental  conditions.  This  data  model  may  be  called  as  “Static  Data  Model”  (SDM),  because  it 
doesn’t  take  into  account  the  history  of  failure  development  of  a  particular  device.  It  was  reasonable 
to  use  such  simplified  data  model  to  focus  research  on  the  mathematical  aspects  regarding  the  task  of 
prognostics. 

Unfortunately,  SDM  is  not  appropriate  for  development  of  the  precise  regression  model  of 
residual  performance  resource  forecasting.  It  will  be  justified  below  that  to  solve  the  last  task  we  need 
a  model  that  reflects  the  history  of  failure  development  for  a  particular  device,  i.e.  we  need  a  model 
that  makes  it  possible  to  specify  the  “trajectory”  of  failure  development  for  a  particular  device.  The 
latter  is  understood  as  ordered  sequence  of  pairs  )  >,  where  -  is  the  cumulative  time  of 

device  performance,  X(ti)  -  is  the  vector  of  adverse  exposures  at  the  time  .  It  is  reasonable  to  call 
the  model  which  enables  to  obtain  trajectory  of  failure  development  of  a  particular  device  as 
“Dynamic  Data  Model”  (DDM). 

On  the  other  hand,  regression  model  for  residual  performance  resource  forecasting  results  in  one 
more  sensitive  parameter  that  may  be  used  for  the  failure  prognostics.  Therefore,  utilization  of  DDM 
instead  of  SDM  makes  possible  to  extend  vector  of  adverse  exposures  by  one  more  component,  for 
example,  number  of  aircraft  sortie.  As  a  result  DDM  makes  it  possible  to  evaluate  in  more  realistic 
way  the  mathematical  basis  of  prognostic  task  solving  developed  on  the  first  phase  of  research  [[R-98] 
and  to  improve  it  if  necessary.  Below  in  this  section  we  describe  the  developed  DDM  for  information- 
based  health  assessment  system  which  is  intended  to  solve  both  tasks  mentioned  at  the  beginning  of 
this  section. 

2.2.  Properties  of  DDM  and  Assumptions 

We  aim  at  developing  DDM  that  is  provided  by  the  following  properties: 

•  correlation  of  components  of  the  vector  of  adverse  exposures  X(t)  at  a  time  t  of 
maintenance  of  a  device  is  known  and  specified  by  their  correlation  matrix  Cx{t,t) 
and  standard  deviations 

O’a'  (0 =[o’[^/(0].O'[^2(0]v..,o-[x„(0]f =[o-/(0»o‘2(0.-,«7-„(0f ;  these 

mathematical  entities  define  covariance  matrix  = 

=M[iX(ti)  — Xiti)){X(ti')  — Xitj))Y  and,  hence,  random  values  of  adverse 
exposures  accumulated  during  a  sortie  of  aircraft; 

•  mathematical  expectation  of  adverse  exposures  per  a  sortie  of  any  aircraft  is  constant  and 
denoted  by  M[  AX]  =  AX  * ; 


’  This  assumption  was  accepted  for  simplicity  of  data  model  implementation  and  doesn’t  influence  on  the 
generality  of  data  model  itself. 
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•  mathematical  expectations  of  accumulated  adverse  exposures  M[X{t)'\  =  X{t)  depends 
on  the  time  t  of  device  maintenance  reflecting  cumulative  character  of  adverse  factors; 

•  individual  biases  of  mathematical  expectations  of  adverse  exposures  SX(t,r)  are 
randomized  for  every  particular  device  number  r,  r=l,2,  ...,R  (this  property  aims  at 
taking  into  account  the  specific  of  manufacturing  of  a  device  and  its  maintenance 
conditions  on  the  board  of  the  particular  aircraft); 

•  mutual  correlation  of  the  vectors  of  adverse  exposures  assigned  to  the  different  values  of 
time  of  maintenance  is  specified  by  the  matrix  of  mutual  covariance 

M[(X(tj)  -  X if f)){X{tj)  -  Xitjfff  that  is  supposed  to  be  computed  numerically  from 
a  statistical  data  base ; 

•  Realization  of  the  random  event  Q&{«no  failure)),  «failure))}={0,l/  is  defined 
according  to  the  truth  values  of  logical  formulae  F  e  5  given  over  the  linear  terms 

»  where  component  x^,  i=l,  2, n  are  the  components 

of  the  vector  of  adverse  exposures,  a^^,a2s,—a„s  ~  valued  coefficients  and  s'  -  is 

the  index  of  the  vector  of  adverse  exposures. 

Additional  assumptions  utilized  within  the  developed  DDM  model  are  as  follows: 

•  correlation  matrix  (tj)  and  standard  deviations  (t)  = 

\c7j  it),  <72  it),...,  (7„  iOY  are  independent  on  the  number  of  aircraft  sortie,  i.e. 

and  therefore,  covariance  matrix  Wj^itj,tf)=Wj^  is  constant 

as  well; 

•  at  average,  each  sortie  of  particular  aircraft  has  2  hours  long;  this  assumption  makes  it 
possible  to  use  the  number  of  sortie  (denote  it  by  symbol  k)  as  an  equivalent  of  time  and 
to  deal  with  discrete  parameter  k  instead  of  continuous  one  t; 

•  number  of  device  r  may  be  identified  as  the  aircraft  number; 

•  distributions  of  random  values  elsewhere  below  are  normal  or  uniform. 

2.3.  Numerical  characteristics  of  DDM 

Thus,  DDM  of  failure  development  of  each  individual  device  (belonging  to  the  aircraft) 
number  r  as  it  was  introduced  in  the  previous  section  is  defined  by  the  following  data: 

•  covariance  matrix  that  can  be  calculated  in  standard  way  via  correlation  matrix  Cj^ 
and  vector  of  standard  deviation  <7j^ ; 

•  mathematical  expectations  of  adverse  exposures  M[Xit)\  =  X(/) ; 

•  individual  biases  of  mathematical  expectations  of  adverse  exposures  SXik,r) ; 

•  number  of  sortie  k  of  the  concrete  aircraft  r  and 

•  set  of  logic  formulae  e  5  that  determinate  the  conditions  corresponding  to  a 
realization  of  the  random  event  ‘failure  ”, 

Therefore,  DDM  may  be  used  for  generation  of  realizations  of  trajectories  Xik,r)  of  failure 
development  of  a  device  on  the  board  of  the  aircraft  number  r=l,  2,...,  R.  Below  the  information 
about  adverse  exposures  and  numerical  data  needed  to  generate  the  realizations  of  trajectories 
Xik,r)  are  given. 


*  We  consider  binary  status  of  device  performance  within  the  designed  DDM.  Notice,  that  in  the  classification 
problem  statement  (see  Section  4)  we  consider  one  more  value  of  device  status. 
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2.Dynainic  Data  Model  of  Failure  Development 
2.3.1.  Table  of  adverse  exposures  (database  composition) 


Vibration 

RMS,  l-2g, 

Vibration 

RMS,  3 -4  g 

imi 

Vibration 

RMS,  over  4g 

Humidity, 

20-50% 

Humidity, 

70-95% 

Environmental 
Temperature  15  -  0°C 

Power  Supply  1.1  - 
- 1.3  nominal  Vdc 

Environmental 
Temperature  0  -  15°C 

Power  Supply  over 
1.3  nominal  Vdc 

Environmental 
Temperature  50  -  75®C 

Functional  Over¬ 
load  5-10%^y 

Environmental 
Temperature  76  - 
lOO^C 

Fimctional  Over¬ 
load  11  -20%Xj 

Power  Supply 
.7  -  .9  nominal  Vdc 

Fimctional  Over¬ 
load  21 -30% 

One  more  column  (#21)  of  Database  composition  contains  the  value  of  device  status  and  is 
omitted  in  the  above  table  of  Database  composition. 


2.3.2.  Covariance  Matrix 


■  » 


3.16e-002 

•4.74e004 

-6.8e-003 

1.43e-003 

3.6-003 

1.53e-003 

3.63e-003 

1.G9e-004 

-2.75e-003 

•2.69e-003 

-2.246-004 

1.046-003 

3.86e-003 

4.656-003 

7.9e-003 

-6.64e-003 

-7.07e-003 

-7.176-005 

G.27e-004 


-4.74e-004 
2.376-003 
-3.2e-004 
8.086-004 
3.76e-004 
9.26-004 
6. 856-004 
1.96-005 
-1.71e-004 
-1 .856-003 
6.G4e-004 
6.27e-004 
1.836-003 
1.38e-003 
-2.76e-004 
-3.5Ge-004 
-7.7Ge-004 
-1.396-004 
2.076-004 


-6.8e-003 
-3.2e-004 
2.326-003 
5.136-004 
-2.246-004 
1.846-003 
-4.736-004 
-2.476-004 
2.376-004 
-8.696-004 
1.236-003 
3.44e-004 
-1.996-003 
-7.418-004 
-1.996-003 
1.546-003 
1.076-003 
-4.036-005 
2.176-005 


1.43e-003 

8.086-004 

5.136-004 

0.103 

-2.766-002 

6.896-003 

-2.696-003 

-1.896-003 

-1.G1e-003 

-8.086-004 

7.556-004 

6.31e-004 

2.786-003 

2.16-003 

1.16e-003 

-1.966-004 

-1.1e-003 

4.246-003 

-3.556-003 


3.6- 003 
3.76e-004 
-2.246-004 
-2.766-002 

9.596- 003 

5.596- 003 
5.45e-004 
-4.246-004 
-8.29e-004 
-1.296-003 
4.286-004 
G.34e-004 
3.186-004 

8.6- 004 
4.876-004 
-1.076-003 
-8.69e-004 
-9.166-004 
1.026-003 


1.536-003 

9.2e-004 

1.846-003 

6.89e-003 

5.596-003 

0.14 

-1.G9e-002 

-2.136-002 

-1.89e-002 

-1.04e-002 

1.776-002 

7.146-003 

2.86e-003 

9.346-004 

-4.3e-003 

-2.026-003 

-4.846-003 

9.746-003 

-7.69e-003 


3.63e-003 

6.856-004 

-4.736-004 

-2.696-003 

5.45e-004 

-1.696-002 

1.29e-002 

2.756-003 

1.186-003 

-1.236-002 

-2.696-003 

4.27e-003 

2.916-003 

1.8Ge-003 

1.23e-003 

-8.596-004 

-5.586-004 

-1.866-003 

1.536-003 


Covariance  Matrix  Wy  (continuation) 


mil 

101 

11 

mm 

13 

1 

1.696-004 

-2.756-003 

-2.69e-003 

-2.246-004 

1.046-003 

9.66e-003 

2 

1.96-005 

-1.716-004 

-1.856-003 

6.64e-004 

6.276-004 

1.896-003 

3 

-2.476-004 

2.376-004 

-8.696-004 

1.236-003 

3.446-004 

-1.996-003 

4 

-1.896-003 

-1.616-003 

-8.086-004 

7.556-004 

6.31 6-004 

2.786-003 

5 

-4.246-004 

-8.296-004 

-1.296-003 

4.286-004 

6.34e-004 

3.186-004 

6 

-2.136-002 

-1.896-002 

-1.046-002 

1.776-002 

7.146-003 

2.866-003 

7 

2.756-003 

1.18e-003 

-1.236-002 

-2.696-003 

4.276-003 

2.916-003 

8 

4.576-003 

3.216-003 

-2.5e-003 

-7.76-004 

5.446-004 

-3.396-003 

9 

3.216-003 

3.836-003 

-4.376-004 

-1.256-003 

8.096-005 

-7.036-003 

10 

-2.5e-003 

-4.376-004 

4.556-002 

-1.236-002 

-1.756-002 

2.616-003 

11 

-7.76-004 

-1.256-003 

-1.236-002 

8.896-002 

-2.896-003 

-1.326-002 

12 

5.446-004 

8.096-005 

-1.75e-002 

-2.896-003 

8.246-003 

-7.386-003 

13 

-3.396-003 

-7.036-003 

2.616-003 

-1.326-002 

.  -7.386-003 

0.187 

14 

-1.826-004 

-4.176-005 

-2.216-003 

5.296-004 

-8.196-004 

-2.016-002 

15 

2.56-003 

1.936-003 

-3.866-003 

7.736-003 

3.866-003 

-5.326-002 

16 

1.316-003 

2.736-003 

-3.3e-003 

9.426-003 

2.686-003 

-4.616-002 

17 

1.436-003 

2.386-003 

-8.6Ge-004 

7.6-004 

1.496-003 

-2.976-002 

18 

-1.876-003 

-1.786-003 

3.246-003 

-3.186-003 

-9.086-004 

2.666-003 

19 

1.566-003 

1.376-003 

-2.736-003 

2.656-003 

8.896-004 

-3.536-003 

2.Dynaiiiic  Data  Model  of  Failure  Development 


Covariance MatrixWjf  (continuation) 


1 

1^ 

1 

4.65e-003 

7.9e-003 

-6.646-003 

-7.076-003 

-7.176-005 

6.276-004 

2 

1.38e-003 

-2.76e-004 

-9.56e-004 

-7.76e-004 

-1.396-004 

2.076-004 

3 

-7.41e-004 

-1.996-003 

1.546-003 

1.076-003 

-4.036-005 

2.176-005 

4 

2.1e-003 

1.166-003 

-1.966-004 

-1.16-003 

4.246-003 

-3.556-003 

5 

8.e-004 

4.876-004 

-1.076-003 

-8.696-004 

-9.166-004 

1.026-003 

6 

9.34e-004 

-4.3e-003 

-2.026-003 

-4.846-003 

9.74e003 

-7.696-003 

7 

1.86e-003 

1.236-003 

-8.596-004 

-5.586-004 

-1 .866-003 

1.53e-003 

8 

-1.82e-004 

2.5e-003 

1.316-003 

1.436-003 

-1.876-003 

1.566-003 

9 

-4.17e005 

1.93e-003 

2.736-003 

2.386-003 

-1.786-003 

1.376-003 

10 

-2.21e-003 

-3.86e-003 

-3.36-003 

-8.66e-004 

3.24e-003 

-2.736-003 

11 

5.296-004 

7.736-003 

9.426-003 

7.6-004 

-3.186-003 

2.656-003 

12 

-8.19e-004 

3.86e-003 

2.68e-003 

1.496-003 

-9.086-004 

8.696-004 

13 

-2.016-002 

-5.32e-002 

-4.616-002 

-2.97e002 

2.666-003 

-3.536-003 

14 

6.35e-002 

-1.476-002 

-7.266-003 

-3.056-003 

-2.76-004 

4.176-004 

15 

-1 .476-002 

3.796-002 

1.356-002 

7.56e-003 

-1.216-003 

1.86-003 

16 

-7.266-003 

1.356-002 

1.896-002 

1.096-002 

-1.226-003 

1.046-003 

17 

-3.056-003 

7.56e-003 

1.096-002 

8.73e003 

-8.046-004 

5.236-004 

18 

-2.7e-004 

-1.216-003 

-1.226-003 

-8.046-004 

2.116-003 

-1.46e-003 

19 

4.176-004 

1.8e-003 

1.046-003 

5.23e-004 

-1.466-003 

1.386-003 

While  designing  the  correlation  matrix  we  took  into  account  the  actually  existing 
dependencies  between  adverse  factors.  These  dependencies  were  extracted  from  the  expert. 

The  correlation  of  adverse  exposures  Xj-  Xjg,  on  the  one  hand,  and  residual  performance 

resource  Xjo ,  on  the  other  hand,  have  to  be  computed  via  simulation. 

2.3.3.  Vector  of  standard  deviations  CT^ 

a(xj)=0.18,  a(x2)=0.049,  a(x3)=0.048,  o(xj=0.321,  a(x3)=0.097,  a(xj=0.04, 
o(Xj)=0.113,  a(Xs)=0.068,  o(Xg)=0.061,  o(xj=0.067, 

a(Xj,)=0.198  C7(x,2)=0.09,  a(x,3)=0.432,  o(xj=0.252,  a(xj3)=0.195,  a(xj=0.137. 
a(xj,)=0.934.  a(xj=0.046,  o(x,g)=0.037. 

According  to  the  well  known  algorithm  components  of  the  covariance  matrix  of  the  vector  X(k,r) 

was  calculated  as  follows: 

(^i  >^j)  ~  ^X  0’ ./)  ^  ’ 

where  Cj^  (/,  j)  -  are  the  elements  of  the  correlation  matrix  C ^  • 

2.3.4. Predicate  F:  description  of  the  status  ‘failure  ” 

Fj  =  {X,  >25),  F,  ={  x,2 ^7),  F,  ={  Xp  >28),  F,  ={  x^p  >27}. 

Fj  ={0.7xxy  ■\-0.2xx2  +0. 7XX3  ^0}={  Yj>30}, 

Fg  ={0.05xxg  +O.IXX2  +0.20xxg+0.6xXg+0.04xx3-0.01xx^>30}={Y2  >30), 

Fj  ={0.02xxio  +0.3xx2i  +0.68xxj2^0}={  Y^  >30  }, 
Fg={0.02xxj3+0.08xxj^+0.1xxj3+0.3xXjg+0.5xxi2>30}={Y4  >30), 

Ys={0.3xjg+0.7x,g  }, 

Yg  ={1.5xxg  -0.5  XX  4 }, 

FZl={0.2xYj  +O.25XY2  +0.3xY3+0.2xYg+0.05xYg>20}={Zj>20), 

FZ2={0.2xY2  +0.2Y3+0.4xY,  +0.1  xYg  +0.1xYg>15}={  Z2  >  15  }, 
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FZ3={0.  ISxYj  +0. 25  xY^  +ft  4xY,  +ft  IxY^  +0.  lxYg>16}={Z3>16  }, 

FF={0. 2xZj  +0,  4xZ2  +0. 4xZ,  >1 7}. 

The  logic  condition  of  the  event  “failure  ”  is  as  follows: 

F=  Fj  VF2  vFj  vF^  vFg  vFg  vFy  vFg  vFZlvFZ2vFZ3vFF  =  ”true”. 

Otherwise  the  status  of  the  device  is  “no  failure 

2.4.  Generation  of  trajectories  of  failure  development 

We  suppose  that  increment  of  adverse  exposures  per  sortie  consists  of  two  components.  One 
of  them  is  randomized  bias  SX(k,  r)=[dXj{k,r),dK2{k,r),...,dXjg(k,r)\  thathas  individual 
distribution  for  each  device  r  and  depends  on  the  number  of  sortie  k  as  well,  and  the  second  one  is  a 
random  value  AX(k)  which  distribution  is  independent  on  the  individual  properties  of  aircraft.  We 
suppose  that  both  of  them  have  equal  matrices  of  correlation  and  individual  values  of  standard 
deviations. 

Since  the  components  of  these  vectors  of  adverse  factors  are  correlated,  each  point  of 
trajectory  within  DDM  is  generated  in  a  number  of  steps.  They  are  as  follows: 

1.  Transformation  of  the  vector  of  adverse  factors  to  the  form  with  non-correlated  components. 

Since  we  supposed  in  DDM  that  components  of  the  random  vector  of  adverse  exposures  X  are 
correlated  we  have  to  use  special  algorithm  of  generation  of  its  realization  [see,  for  example, 
[Fukunaga-72]. 

Let  be  the  covariance  matrix  of  the  vector  AX(k)  of  increment  of  adverse  exposures  per 
sortie  number  ^  of  aircraft,  and  let  B,  A  be  matrices  of  eigenvectors  and  eigenvalues  of  the  covariance 
matrix  ■  Then 

W=B^  AB 

and  Y=B  ^  JSf  is  a  vector  which  components  are  non-correlated  and  distributed  normally  with 
» where  -  is  i-th  diagonal  element  of  the  matrix  A 
Thus,  in  the  first  step  of  the  algorithm  it  is  necessary  to  calculate  matrices  B  and  A  as  well  as 

2.  Generation  of  the  trajectories  of failure  development  in  terms  of  vector  Y. 

Forr=l,2,...,R 

For  i=l,2,...,19 

Ayi  (0,  r)  =  3(T(y, )  + 4  (4  "is  normal  random  value  having  Mf  4  J=0  and 

Syi(0,  r)=a  Ayi(0,r)  (a  -  is  uniform  random  value,  a  e[0,  aj.weuse  a  =0,1J). 
Xi(0,r)  =  0  endi. 

k=0. 

While  -\F  do  k=k+l 
For  i=l,2,...,19 

Syfk,r)  =  Sf0,r)  +  Ayf0,r)  x  p ,  where  P-'xs  uniform  random  value,  P  e  [0,0.1] ; 
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AyXk,r)  =  ?'(5cr,  +^)  (^  -  is  normal  random  value  having  A/[^  ]=0  and 
cr(^)  ) ,  where  -  is  uniform  random  value,  y  e  \P,1  -a  —  P\) 

Ay,  (k,  r)  =  (k,  r)  +  Ay,  (k,  r)  end  I 

S.Inverse  transformation  of  the  vector  AY  to  the  vector  AX: 

AX(k.r)=BAY(k,r) 

4.  Calculation  of  the  vector  of  cumulative  adverse  exposures 
X(k,  r)  =  X(k  -l,r)  +  AX(k,  r) 

it.  {Result:  trajectoiy  of  failure  development  for  given  device  (aircraft)  #  r) 

endr 

2.5.  Simulation  of  DDM 

The  above  DDM  model  implemented  within  Visual  C++  5.0  and  Access  97  Data  Base 
environment  was  used  to  generate  statistical  dynamic  data  supposed  to  be  used  for  numerical 
validation  of  the  developed  mathematical  model  and  algorithms  of  health  assessment  system  and 
regression  model.  In  the  fig.2.1  -  fig.2.4  the  trajectories  of  failure  development  for  selected  adverse 
exposures  for  25  samples  of  a  device  (aircraft)  are  given.  In  these  figures  horizontal  axis  corresponds 
to  the  number  of  sortie  and  vertical  one  corresponds  to  the  value  of  cumulative  exposure  of  respective 
adverse  factor.  Each  trajectory  consists  of  60-120  points.  Let  us  remind  that  the  average  duration  of  a 
sortie  is  equal  to  2  hours  long. 

Each  trajectory  (a  case)  corresponds  to  a  triple  <kir,X(k,r)>  where  ^  -  is  the  number  of 
sortie,  r  -  is  the  number  of  device  (aircraft),  X(k,r)  -  is  the  vector  of  cumulative  adverse  exposure  at 
the  end  of  the  sortie  number  k.  In  Appendix  A1  the  trajectories  for  more  components  of  adverse 
exposures  development  among  factors  x,{k,r')-X',g{k,r')  are  given. 

Each  trajectory  has  a  final  point  that  corresponds  to  the  status  “  failure”  of  the  device.  Since  the 
number  of  sortie  at  this  point  is  known  and  we  supposed  that  each  sortie  has  2  hours  long,  we  can  map 
each  point  of  trajectory  by  one  more  purposeful  variable,  i.e.  by  variable  T(k,r)  that  has  the  sense  of 

residual  performance  resource.  Indeed,  if  kj^(r)  is  the  number  of  sortie  that  corresponds  to  the  event 
‘failure”  and  At  is  the  duration  of  each  sortie  then 

t{k,r)  =  kj{r)y.At-k{r)y.At=[kf{r)-k{r)\At.  (2.1) 

Formula  (2.1)  makes  it  possible  to  map  each  point  of  all  trajectories  of  failure  development  by 
the  value  of  residual  performance  resource  T(k,r).  On  the  one  hand,  this  mapping  extends  the 
statistical  data  in  the  way  that  makes  possible  to  design  a  regression  model  (see  Section  3).  On  the 
other  hand,  this  mapping  makes  it  possible  to  obtain  one  more  sensitive  parameter  for  prognosis  the 
probability  of  failure  at  a  given  time  in  the  future.  In  the  fig.2.5  -  fig.2.8  we  depicted  the  trajectories 
of  failure  development  in  terms  of  the  adverse  factors  but  used  the  horizontal  axis  marked  by  the 
variable  T(k,r).  In  Appendix  A1  such  trajectories  are  given  for  more  components  of  adverse 
exposures  development. 

2.6.  Interpretation  of  Simulated  Data 

Representation  of  statistical  data  in  the  form  of  the  multitude  of  trajectories  of  failure 
development  gives  a  new  insight  on  the  data  interpretation.  Formally,  all  points  of  any  trajectory 
where  k(r)<kf(r)  correspond  to  the  device  status  “no  failure”.  Nevertheless,  it  is  intuitively  clear 

that  the  points  that  are  proximate  to  the  points  X{kf,r)  form  the  “border-line”  class  of  device 
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2.Dynainic  Data  Model  of  Failure  Development 


Fig.2.2.  Realization  of  trajectories  of  development  of  adverse 
exposure  as  the  functions  of  the  number  of  aircraft  sortie 


Fig.2.4.  Realization  of  trajectories  of  development  of  adverse 
exposure  X,^  as  the  functions  of  the  number  of  aircraft  sortie 


2.Dynainic  Data  Model  of  Failure  Development 


X5 


Fig.2.5.  Realization  of  trajectories  of  development  of  adverse 
exposure  X;  as  the  functions  of  residxoal  performance  resource 


Fig.2.6.  Realization  of  trajectories  of  development  of  adverse 
exposure  X,  as  the  functions  of  residual  performance  resource 
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2.Dynainic  Data  Model  of  Failure  Development 


X12 


Fig.2.7.  Realization  of  trajectories  of  development  of  adverse 
exposure  X12  as  the  functions  of  residual  performance  resource 


Fig.2.8.  Realization  of  trajectories  of  development  of  adverse 
exposure  as  the  functions  of  residual  performance  resource 
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2.Dynamic  Data  Model  of  Failure  Development 

status  which  has  to  be  learnt  together  with  traditionally  considered  binary  statuses  “no 
failure” -“failure”. 

This  consideration  is  the  reason  of  new  database  model  used  below  for  numerical  validation  of 
the  developed  mathematical  model  and  algorithms  of  healthcare  assessment  system  design.  We  used 
statistical  database  in  which  every  case  is  assigned  by  a  value  of  failure  status  from  the  set  {“no 
failure”,  “border-line”,  “failure” }={-!,  0,  1}. 

In  Appendix  A2  the  generated  statistical  database  is  given.  It  contains  200  cases  X(k,r)  each 
extended  by  the  value  of  residual  performance  resource  T(k,r) .  These  cases  are  used  for  knowledge 
engineering  procedures  to  develop  classification  rules.  200  more  cases  of  this  database  are  used  for 
testing  of  the  resulting  classification  rules. 

Let  us  note  that  cases  included  in  database  were  chosen  randomly  among  the  points  of  all 
trajectories  generated  via  DDM.  As  a  result,  we  design  a  database  of  the  same  representation  as  one 
used  in  Interim  Report  [IR-98]  The  main  distinction  of  these  databases  is  that  the  latter  used  3-valued 
interpretation  of  the  status  of  devices  performance.  Let  us  consider  this  question  in  more  details. 

Together  with  conventionally  used  values  of  performance  status  “no  failure”,  “failure”,  in  this 
report  we  use  one  more  value  of  status,  i.e.  the  value  “border-line”.  This  status  is  determined  as 
follows: 

If  X ^{“failure  ”}  and  0<  Xjo  ^  r  then  Xe{“ border-line  ”},  (2.2) 

where  (“failure”}  -  is  the  multitude  of  cases  of  database  having  status  of  performance  “failure  and 
(“border-line”)  -  is  the  multitude  of  all  cases  that  have  status  of  performance  “border-line”-,  the 
value  f  -  is  a  threshold  of  residual  performance  resource  admissible  to  believe  that  status  of  device 
performance  is  “no  failure”.  Below  we  accept  the  value  of  threshold  as  follows:  T  —20.  This  choice 
was  conditioned  by  precision  of  regression  model  of  the  residual  performance  resource  forecasting 
(see  Section  3). 

Numerical  results  obtained  via  DDM  developed  in  this  section  turned  out  convenient  to  test  the 
basic  algorithms  of  the  proposed  prognostic  model  numerically  (see  Section  4). 
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3.  Regression  Model  for  Residual  Performance 

Resource  Assessment 


3.1.  Problem  Statement  and  Traditional  Approach 

In  this  section  we  consider  the  task  of  regression  model  design  for  residual  resource 
assessment.  We  suppose  that  initial  information  needed  for  design  of  the  above  model  is  given  in  Ae 
form  of  the  statistical  database  of  trajectories  of  failure  development.  The  samples  of  such  trajectories 
are  depicted  in  the  fig.2.4-fig.2.8  (Also  see 

The  simplest  formal  problem  statement  of  regression  model  design  is  as  follows.  It  is  given 
database  of  cases  <X,  r  > ,  where  Z-  is  vector  of  cumulative  exposures  of  adverse  factors,  r  -  is  the 
value  of  residual  performance  resource.  The  task  is  to  design  a  function  z  —  f  (^X) .  Traditional 
regression  model  might  be  designed  in  the  following  way.  Let  us  introduce  the  extended  vector 
X^  =[X^ ,71-  While  having  database,  we  are  able  to  calculate  the  vector  of  mathematical 
expectation  and  covariance  matrix  of  the  vector  X^  =  [X^  .  Let  us  represent  them  as  follows: 

M[X]  =  M{X\Tf  ={X\zf ,  (3.1) 


W{X,X]  = 


(3.2) 


‘W[X,X]  W[X,zi 
W{z,X^]  W{z,t] 

where  X,  f  -  are  mathematical  expectations  of  the  respective  variables  and  fV^*,  *J  are  covariance 
matrices  of  the  variables  within  squared  brackets 

Dimensions  of  the  blocks  in  matrix  JV[X,X]  correspond  to  the  following  scheme: 


nxn  nxl 

'19x19 

19x1 

Ixn  1x1 

1x19 

1x1 

In  terms  of  accepted  denotations  the  regression  equation  r  =  /(X)  is  as  follows: 

f(X)  =  M{rlX)  =  t+W{z,XW{X,XY[X-X] .  (3.3) 

But  this  equation  is  not  appropriate  in  practice  because  it  doesn’t  depend  on  such  a  sensitive 
variable  as  number  of  sortie  k.  DDM  described  in  the  previous  section  makes  it  possible  to  design  the 
more  sophisticated  and  precise  regression  model. 

3.2.  DDM-based  Regression  Model 

Let  us  consider  database  generated  by  DDM.  For  this  database  we  are  able  to  map  each  point 
of  trajectory  by  the  argument  value  “number  of  sortie”,  and  to  calculate  the  value  of  variable  “residual 
performance  resource”.  This  means  that  we  may  design  regression  model  in  the  following  form  ([Rao- 
71],  [Anderson-60]): 

f  (Z,  A:)  =  M[T{k)IX(k))  =  z{k)  +  ir[T(A:),Z(A:)MZ(A:),X(A:)]-'[^(A:)  -  J(A:)]  (3.4) 

Comparing  to  the  previous  case  (3.3),  the  peculiarity  of  the  regression  model  (3.4)  is  that  all 
statistics  (mathematical  expectations  and  matrices  of  covariance)  are  computed  on  the  basis  of 
sampling  mapped  by  the  same  value  of  the  number  of  sortie.  Algorithm  of  their  calculations  are  well 
known  ([Rao-71],  [Anderson-60]). 

The  next  step  of  regression  model  improvement  is  the  use  of  history  of  failure  development  and 
autoregression.  Actually,  for  each  sortie  number  A:  of  an  aircraft  r  the  history  {X(l,r),  X(2,r),..., 
X(k,r)}  is  known.  This  history  may  be  used  to  improve  regression  model  in  the  following  way. 
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Let  us  choose  an  integer  value  m.  We  say  that  regression  model  has  the  depth  of  memory  equal 
to  m  if  for  any  sortie  number  k  the  values  of  X(k-in),  X(k-in+l),  ...,X(k-l),  X(k)  )  are  used  for  the 
regression  model  design.  Let  us  show  how  the  regression  model  of  the  depth  m  may  be  designed. 

Let  us  introduce  denotation  X{k)=<  X(Jc),T{k)>  that  is  the  vector  of  adverse  factors 
extended  by  the  value  of  residual  performance  resource  at  the  end  of  a  sortie  of  number  k.  Let  us 
calculate  the  following  covariance  matrices  associated  with  the  vectors  X(k-m), 

Xik-m  +  l),...,X(k): 

W(X(k-m),X(k-m)],  W(X(k-m)^(k-m+l)] . W(X(k-m),X(ky\; 

W(M-m+l)^(k-m+2)];,...,W(X(k  -m  +  l),X(k)] 


W(X(k%X(k)] 

To  reduce  the  above  task  to  the  standard  form  (3.4)  of  the  regression  model  design  let  us 
compose  the  following  block-wise  matrices: 

WiXik-ni^fCik-rri^]  ]^X{k-ni)^{k-m+l)\  ...  W[X(f-n^{k)\ 

WiX(]c-m+r)f({k-ni)\  W\X{k-m+l)X{k-m+l)\  ...  W[_X{k-m+l)X{k)\ 

WlX(]e,rti)]=  (3-5) 

W[X{k)X{k-nii\  mX(Ji)M-m+l)\  ...  W[_X(f)JXm 


m,m,X,T]=^X{k-m),‘tik)'\  W[X{k-m+I),'c{k)  ...  W[X{k),T{k)i  (3.6) 


and 

W[k,m,T^ 


WiXiKm)]  W{k,m,X,T^ 
W'^[k,m,X,T\  <7\r(k)\ 


(3.7) 


As  well  let  us  suppose  that  all  mathematical  expectations  used  below  are  calculated. 

Let  we  know  the  history  of  adverse  exposures  along  the  trajectory  of  failure  development  X(k- 
m),  X(k-m+I),  ...,X(k).  Let  us  denote  this  history  as  follows: 

Xik,m)=[X^ik  -  m)X{k  -m  +  l),...X(k)f ,  (3.8) 


X(k,m)  =  M[X{k,m)].  (3.9) 

While  utilizing  the  formula  like  (3.4)  for  matrices  W[X(k,m)J  (3.5),  W[k,m,X,T]  (3.6)  and 
mathematical  expectation  X{k,m)  (3.9),  we  can  to  constitute  the  following  equation  for  assessment 
of  the  residual  performance  resource: 

r(it/X(A:,/«))=f(^)  +  pr'’[^,/n,X,rMX(^,/«)r'[X(fc,7w)-X(^,w)].  (3.10) 


3.3.  Numerical  results 

We  have  investigated  numerically  what  parameters  of  the  regression  procedure  are  sensitive 
regarding  to  the  precision  of  the  assessed  residual  performance  resource.  There  were  considered  two 
of  them,  i.e.  m  -  is  the  depth  of  memory,  and  5  -  is  the  interval  between  two  points  of  trajectory 
(“step")  involved  in  regression  model.  The  sense  of  the  memory  depth  m  has  been  explained  already. 
The  sense  of  the  variable  s  is  as  follows.  Let  us  use  memory  depth  m=2.  It  means  that  to  assess 
residual  performance  recourse  we  use  three  values  of  history  of  adverse  exposures  development,  i.e. 


18 


3.  Regression  Model  for  Residual  PerformanceResource  Assessment 


Xik2)  and  Xikj).  Difference  between  two  sequential  values  of  variable  “number  of 
sortie  ”  we  mean  as  step  of  regression  procedure,  i.e.  s=k2  —ki=k^  —k2. 

It  is  clear  that  the  more  value  of  variable  m  the  more  computational  complexity  of  regression 
procedure.  Actually,  increase  of  m  entails  remarkable  increase  of  dimension  of  matrices  in  the 
equation  (3.10).  We  investigated  regression  model  precision  for  m=0,  1,  2  for  different  values  of 
variable  s.  The  results  are  given  in  the  fig.3. 1  -fig.3.8. 

As  a  conclusion  it  was  adjusted  that  the  most  appropriate  value  of  m  is  equal  to  2  and  value  of  ^ 
is  about  (5-10).  Decrease  of  5  entails  increase  of  noise  but  increase  of  it  lead  to  decrease  of  precision. 
Of  course,  this  conclusion  is  valid  for  concrete  data  model  and  sampling  size  for  considered  applied 
task. 

The  more  important  and  traditionally  discussed  problem  is  what  adverse  factors  have  the  most 
noticeable  impact  on  the  value  of  residual  performance  resource.  Is  it  possible  to  diminish  the  size  of 
the  vector  of  adverse  factors  to  minimum,  for  example,  to  4-5  preserving  the  precision  of  the 
regression  model  within  required  limits?  This  task  may  be  solved  within  the  considered  frameworks, 
but  it  takes  to  attract  special  approaches  (not  only  statistical  ones)  and  is  a  subject  of  special  research. 
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Values  of  statistics 


3.  Regression  Model  for  Residual  PerformanceResource  Assessment 


m=0 


Fig.3.1.  Realizations  of  error  of  residual  performance  resource  assessment  for  m—0. 


average  standard  deviation  - - Minimum  - Maximun 


Fig.3.2.  Properties  of  regression  model  (memory  depth  m=0). 

Horizontal  axis  corresponds  to  the  number  of  sortie  in  which  the  residual  performance  resource 
is  assessed.  "Average”  is  the  mathematical  expectation  of  the  error  of  residual  performance 
resource  assessment.  “Variance”  is  the  standard  deviation  of  the  error.  “Max”  and  “Min  ”  values 
correspond  to  the  maximal  and  minimal  values  of  the  error  over  the  tested  trajectories  for  25 
units. 
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3.  Regression  Model  for  Residual  PerformanceResource  Assessment 


Fig.3.3.  Properties  of  regression  model  (memory  depth  »j=/and  step  =3  sorties). 

Horizontal  axis  corresponds  to  the  number  of  sortie  in  which  the  residual  performance  resource 
is  assessed.  "Average”  is  the  mathematical  expectation  of  the  error  of  residual  performance 
resource  assessment.  “Variance”  is  the  standard  deviation  of  the  error.  “Max”  and  “Min  ”  values 
correspond  to  the  maximal  and  minimal  values  of  the  error  over  the  tested  trajectories  for  25 


70  -  80 


-Average - Variance _ Min  -  Max  | 


Fig.3.4.  Properties  of  regression  model  (memory  depth  m=/and  step  =10  sorties). 

Horizontal  axis  corresponds  to  the  nximber  of  sortie  in  which  the  residual  performance  resource 
is  assessed.  “Average”  is  the  mathematical  expectation  of  the  error  of  residual  performance 
resource  assessment.  “Variance”  is  the  standard  deviation  of  the  error.  “Max”  and  "Mm”  values 
correspond  to  the  maximal  and  minimal  values  of  the  error  over  the  tested  trajectories  for  25 


3.  Regression  Model  for  Residual  PerformanceResource  Assessment 


Fig.3.5.  Properties  of  regression  model  (memory  depth  m=2  and  step  -3  sorties). 

Horizontal  axis  corresponds  to  the  number  of  sortie  in  which  the  residxial  performance  resource  is 
assessed.  “Average”  is  the  mathematical  expectation  of  the  error  of  residual  performance  resource 
assessment.  "Foriance"  is  the  standard  deviation  of  the  error.  “Max”  and  "Mw  "  values  correspond 
to  the  maximal  and  minimal  values  of  the  error  over  the  tested  trajectories  for  25  units. 


Fig.3.6.  Properties  of  regression  model  (memory  depth  m=2  and  step  =10  sorties). 

Horizontal  axis  corresponds  to  the  number  of  sortie  in  which  the  residual  performance  resource 
is  assessed.  “Average”  is  the  mathematical  expectation  of  the  error  of  residual  performance 
resource  assessment.  “Variance  ”  is  the  standard  deviation  of  the  error.  “Max  "  and  “Min  "  values 
correspond  to  the  maximal  and  minimal  values  of  the  error  over  the  tested  trajectories  for  25 
units. 
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4.  Knowledge  Discovery  from  Statistical  Data  Base  for 

Health  Assessment  System  Design 

4.1.  Outline  of  the  Technology  of  Knowledge  Discovery  from  Statistical  Data  Base 

This  section  is  devoted  to  the  outline  of  the  developed  approach  that  forms  a  Knowledge 
Engineering  (Knowledge  Discovery  from  Data  (BCDD))  technology  for  the  prognostic  model 
development.  Main  ideas  of  the  technology  were  described  in  the  Interim  Report  [IR-98].  In  this 
section  we  in  brief  repeat  the  technology  description,  include  some  new  results  obtained  in  the  second 
and  third  phases  of  research  and  demonstrate  the  technology  numerically  on  the  basis  of  database 
generated  by  DDM  described  in  Section  2.  It  should  be  noted  that  this  section  is  self-contained  and 
doesn’t  require  to  be  familiar  to  the  contents  of  the  Interim  Report  [IR-98]. 

In  contrast  to  the  data  interpretation  accepted  in  the  [IR-98],  below  we  consider  data  divided 
into  three  clusters,  i.e. 

•  data  records  interpreted  as  “normal performance  ”  (“no  failure  ”); 

•  data  records  interpreted  as  “border-line  performance"  (the  cases  themselves  correspond  to  “no 
failure”  but  residual  performance  resource  is  less  than  given  (chosen)  threshold),  and 

•  data  records  interpreted  definitely  as  “failure 

Let  us  denote  these  clusters  of  data  records  (statuses  of  performance)  as  “-1”,  “0”  and  “1" 
respectively. 

The  KDD  process  aims  at  the  development  of  a  model  and  a  model-based  prognostic  procedure 
that  provide  high  quality  of  classification  problem  solving  and  precise  prognosis  of  the  probability  of 
failure.  The  latter  is  utilized  for  estimating  the  probability  of  failure  of  a  particular  module  at  a  given 
time  in  the  future,  say,  during  a  forthcoming  sortie,  on  the  basis  of  the  current  "history  of  abuse”  of 
this  module. 

Let  us  remind  that  from  the  Data  Mining  point  of  view,  this  problem  constitutes  the 
classification  task.  A  peculiarity  of  the  numerical  task  considered  below  compared  to  one  considered 
in  the  Interim  Report  [IR-98]  is  as  follows.  In  this  Report  we  consider  the  task  that  deals  with  database 
divided  into  three  clusters  as  far  as  in  [IR-98]  we  have  considered  the  binary-status  task.  This 
peculiarity  necessitates  utilization  of  a  multi-step  decision  making  procedure  according  to  the  meta¬ 
tree  depicted  below  in  the  fig.  4.1.  Let  us  note  that  developed  software  makes  it  possible  to  implement 
search  according  to  this  tree  for  general  case  when  the  number  of  clusters  is  equal  to  an  arbitrary  finite 
integer  value.  Each  step  of  search  according  to  the  above  meta-tree  (within  a  node  of  meta-tree) 
corresponds  to  the  technology  developed  in  the  Interim  Report  [IR-98].  Below  we  describe  the 
upgraded  version  of  this  technology. 

Let  us  stress  the  difference  between  two  notions  used  in  this  report,  i.e.  “decision  tree”  and 
“meta-tree”.  The  latter  aims  at  dividing  statistical  database  into  two  sub-databases  via  assigning  each 
case  the  name  of  meta-class  which  it  belongs  to.  It  is  high-level  procedure.  This  procedure  doesn’t 
results  in  any  classification  predicates.  It  should  be  noticed  that  “meta-tree”  design  is  exclusively  the 
task  of  expert’s  responsibility.  In  contrast,  “decision  tree”  aims  at  designing  classification  predicates 
which  forms  rules  for  separation  a  (meta  -)  cluster  from  another  one  according  to  “meta-tree.  It 
should  be  noticed  that  if  we  deal  with  the  binary  classification  task  then  the  corresponding  meta-tree 
consists  of  the  single  node. 

The  Data  Mining  technology  within  each  node  of  meta-tree  (see,  for  example,  fig.4.1)  in  the 
main  features  corresponds  to  one  developed  in  [IR-98].  Let  us  outline  it  in  brief. 

Decision  tree  design  includes  the  development  of  a  partially  ordered  set  of  nodes  and  a  predicate 
dividing  a  set  of  cases  into  two  subsets.  Consider  creating  i-th  node  having  as  input  argument  a  subset 
of  training  data  Sj  which  contains  cases  of  two  clusters  (for  example,  'border-line"  and  "failure".  In 
general  case  we  deal  with  two  meta-clusters. ).  The  goal  of  the  procedure  associated  with  the  node  is 
dividing  the  set  of  cases  into  two  clusters  in  the  possibly  best  way  assigning  to  the  node  a 
separation  rule  and  corresponding  predicate.  This  procedure  design  consists  of  a  number  of  steps.  We 
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consider  it  below  for  the  node  of  meta-tree  corresponding  to  the  subset  of  cases  "'Status  performance” 
e  {“0  ”  V  “I  ”}={“border-Ime  ”,  “failure  ”}. 

1.  Ranking  of  two-dimensional  subspaces  (“2-d  subspaces”)  of  the  entire  factor  space  in  accordance 
with  the  chosen  criterion  of  informativity  calculated  over  training  data  .  It  is  available  for  a 

developer  to  use  a  number  of  heuristic  criterions  of  informativity.  Selection  of  the  most  informative  2- 

d  subspaces  is  very  important  task 
and  is  solved  via  user-computer 
interactions.  User-friendly 

interface  makes  this  task  simple 
for  user  (see  Subsection  4.3 
below). 

2.  Visualization  of  the 
projections  of  both  clusters  of 
training  data  iS,  onto  the  selected 
2-d  subspace  and  providing  a 
developer  with  the  opportunity  to 
adjust  the  separation  rule  manually 
using  a  computer  graphical 
interface.  This  procedure  is  a  ^ 
point  of  the  technology.  It  makes 
possible  to  design  separation  rules 
of  arbitraiy  shape  manually.  Note, 
that  the  developed  software  makes  it  possible  to  draw  manually  a  non-linear  and  even  non-convex 
separation  rules  and  automated  generation  of  the  associated  predicates. 

3.  Division  of  the  experimental  data  5,  into  two  non-overlapping  subsets  Sf  ,  SJ ,  5,  =S^  u  S’r .  The 

subset  S*  contains  the  cases  of  5,  such  that  the  predicate  obtained  at  the  previous  step  is  "/rue  "over 

all  cases  of  the  set  ,  and  the  subset  SJ  contains  the  cases  of  5,  over  which  it  is  'false".  Based  on 

an  additional  criterion,  each  of  two  subsets  S*  and  Sj  is  classified  as  a  leaf  Rj  of  the  decision  tree 

under  development  or  as  its  new  intermediate  node.  Decision  tree  development  is  ended  if  it  does 
not  contain  intermediate  nodes  that  were  not  processed  in  accordance  with  the  procedures 
described  above  in  the  steps  1-3. 

4.  Each  leaf  Rj  of  the  decision  tree  is  mapped  to  the  predicate  Pj ,  which  is  constituted  as  conjunction 
of  predicates  met  along  the  way  from  root  node  up  to  the  leaf  Rj .  In  addition,  each  leaf  Rj  is 
mapped  to  a  subset  of  cases  of  experimental  data  for  which  predicate  Pj  is  "true".  Note  that  each  leaf 
Rj  may  contain  cases  of  experimental  data  belonging  to  both  clusters  (for  example,  “border-line” 
and  "failure")  or  only  to  one  of  them.* 

Let  r  decision  trees  are  developed.  Then  the  following  task  is  performed  to  constitute  the  decision¬ 
making  procedure. 

5.  For  each  decision  tree  number  k,  (k=l,  2,...,  r),  definition  of  the  probabilistic  space  in  which  every 
leaf  R^j  of  decision  tree  constitutes  an  elementary  event.  Each  elementary  event  i?*  is  characterized 
by  a  confidence  interval  probabilities  pj.(X/”border-line”)  and  p (X/"failure”)  estimate  defined 
empirically  using  training  and  testing  data  for  any  vector  of  factors  Xe  iJ* . 


‘  As  a  particular  case,  we  may  consider  a  decision  tree  that  consists  only  of  a  root  node. 
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The  next  two  steps  correspond  to  decision  making  procedure  itself  and  are  utilized  for  model-based 
estimating  (forecasting)  the  probability  of  failure  of  a  particular  module  at  a  given  time  on  the  basis  of  its 
current  "history  of  abuse". 

6.  For  given  vector  of  factors  X,  definition  of  the  leaf  Rj  to  which  vector  of  factors  X  belongs  in  each 

decision  tree  of  number  k  and  hence,  definition  of  the  values  of  probabilities  (X/»border-line») 

and  Pi  (X/' failure")  (see  step  5)  obtained  by  each  decision  tree.  Joint  processing  of  the  above 
probabilities  on  the  basis  of  so  called  "Algebraic  Bayes’  Network"  (see  [IR-98],  [Gorodetski-92], 
[Gorodetski  at  al-97])  and  calculation  of  the  final  values  of  probabilities  p(X/"border-lme")  and 
p(X/' failure"). 

7.  Definition  of  the  decision  making  scheme  based  on  Bayes’  approach  [Skormin  et  all-97].  This 
procedure  aims  at  calculation  of  the  probability  of  failure  p("failure"/X)  of  the  device  affected  by  the 
given  cumulative  exposures  of  factors  X 

8.  Testing  the  develop^  prognostic  model  and  model-based  prognostic  procedure  by  using  an  array  of 
both  training  and  examination  data  to  assess  properties  of  the  model  and  the  decision-making 
procedure. 

Of  course,  the  opportunity  to  utilize  our  procedure  for  assessing  the  probability  of  failure  during  the 
future  cycle  of  the  module  operation  depends  upon  the  ability  to  forecast  exposure  of  adverse  factors  at 
the  time  in  question.  The  task  of  development  of  the  appropriate  means  of  forecasting  was  developed  in 
[Popyack-98]. 

Note  once  more  that  the  first  five  steps  result  in  the  definition  of  a  prognostic  model.  The  next  two 
steps  constitute  a  model-based  decision-making  procedure.  The  last  step  is  aimed  at  validation  of  the 
resulting  model  and  model-based  decision-making  procedure. 

The  mathematical  model  discussed  herein  is  applicable  to  solving  such  practical  prognostic  related 
problems  as; 

•  ranking  particular  environmental  conditions  as  factors  responsible  for  general  and  particular  types  of 
failures, 

•  determination  of  particular  groups  of  environmental  conditions  and  assessment  of  their  combined 
effects  on  failures  in  general  and  on  particular  types  of  failures, 

•  tracking  the  dynamics  of  development  of  cluster  models  and  their  statistical  characteristics  in  the 
process  of  obtaining  new  experimental  data, 

•  justification  of  the  development  of  devices  protecting  avionics  fi-om  adverse  environmental  conditions, 

•  development  of  the  recommendations  on  the  avoidance  of  the  combined  effects  of  adverse  conditions. 
This  could  be  performed  in  real-time  on  the  board  of  an  aircraft  or  spacecraft. 

Below  we  consider  the  above-described  steps  of  the  proposed  technology  in  more  details. 

4.2.  Heuristic  Selection  of  Informative  Subspaces.  Informativity  Criteria 

It  is  well  known  that  learning  procedures  aimed  at  extracting  knowledge  from  data  are 
computationally  intensive.  To  decrease  the  amount  of  computations,  researchers  often  use  heuristic 
and  intuitive  notions  such  as  "informativity",  "similarity",  etc.  Formally  specified,  heuristic  and  intuitive 
notions  are  always  problem-  or  domain-oriented.  In  our  approach  we  use  the  intuitive  notion  of 
informativity  to  r^  the  subspaces  of  factors  (features)  and  to  select  a  more  compressed  specification  of 
experimental  data  for  further  processing  on  this  basis.  We  have  investigated  a  number  of  formal 
specifications  of  informativity  criteria.  All  of  them  can  be  interpreted  as  mean  square  normalized  and, 
possibly,  weighted  distance  between  two  clusters  of  statistical  data.  For  large  amounts  of  data  the 
same  criteria  can  be  specified  in  terms  of  corresponding  statistics  assessed  over  data  empirically. 

Herein  and  below  we  use  the  following  notations:  X  =  -  is  a  vector  of  factors 

representing  cumulative  exposure  to  adverse  conditions  in  hours;  Qe{“—1”,  “0”,  -  is  an 

integer  variable  symbolizing  the  output  discrete  event  (  "normal  operation  of  the  device"  corresponds 
to  Q=”0”  corresponds  to  the  ‘‘border-line’’  status  of  the  device  performance  and  "the 

device  failed"  corresponds  to  Q=”l  observed  data  are  indexed  by  the  symbols  r,  s;  the  number  of 
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cases  associated  with  the  node  "Status  of  performance”  ={“0”  v  “1  ”}  is  equal  to  N,  N=Kf^-^K^, 
where  are  the  total  number  of  realizations  of  cluster  ”0"  and  cluster  "1"  respectively. 

Therefore,  experimental  database  consists  of  the  subsets  of  realizations  of  cluster  "0"  marked  by 
superscript  "0",  for  example,  (r)} ,  and  realizations  of  cluster  "i"  marked  by 

superscript  "7"  for  example,  {x*  (5),  X2(j),...,x‘(j')}.  Additional  notations  will  be  introduced  later. 


The  informativity  criteria  were  selected  as  the  most  appropriate  due  to 

•  their  adequacy  to  experts’  intuitive  interpretation  of  the  subspace  informativity, 

•  complexity  of  the  subspace  ranking  task  and 

•  on  the  basis  of  numerical  experiments  [IR-98]. 

Criteria  (4.1)  -  (4.3)  below  correspond  to  a  two-dimensional  case  but  they  also  can  be  defined  in  a 
subspace  of  arbitrary  dimension^. 


-H(Ax;’‘)Va?+(Ax;’')VcT^ 


(4.1) 

(4.2) 


where  O/,  -  are  standard  deviations  of  variables  x,  and  x^  estimated  over  the  entire  range  of 

experimental  data,  (Ax®’*)^,(Ac®’‘)^  are  squared  distances  between  mathematical  expectations  of 


vectors  of  factors  within  clusters  "0"  and  "1 "  respectively  in  die  subspace  comprising  factors  X; ,  x^  . 


where  a® ,  aj  -  are  weights  assigned  to  cases  (realizations)  number  r  and  number  s  of  clusters  "0" 
and  "/"respectively.  Weights  a®  and  aj.  are  calculated  according  to  the  algorithm  given  below: 

Ax,  =(x,®-x/)/ct,.  Ax,  =(x® -x,‘)/a,,  b  =  ^(Ax,y  +(Ax,)^ 
e,,  =<  Ax,  /  6,  Ax,  /  6  >=<  e,  ,e,  > 

For  all  cases  of  cluster  «1»  do  (r=l,2,...,  KJ 
dr=\  +e,(x;(r)-x,®)/<7,  | 

dl=\  e,(x;(r)-x/)/«7,  -he,(x*(r)-x*)/o-,  | 

,\\ifdl>dl 

[0,  //t7®<f7^ 

For  all  cases  of  cluster  «0»  do  (s=l,2,...,  KJ 
ds=\  e,{x°(s)-x!)la,  -Ke,(x,®(5)-x,®)/tr,  | 
dl=\  ei(x°(s)-xj)/<r,  -f-e,(x,®(5)-xJ)/o-,  | 

,_(lifdl>d!, 

|o, 

Detailed  explanation  of  the  sense  of  weights  a®  and  a\  was  given  in  [IR-98]. 


^  It  was  told  that  due  to  the  use  of  the  notion  of  meta-tree  we  reduce  the  general  case  of  classification  task  to  the 
case  of  two  clusters.  That  is  why  we  consider  below  a  binary  classification  and  for  numerical  demonstration  we 
use  cases  of  clusters  “0"  and  “1”  since  separation  of  these  clusters  is  more  difficult  task. 
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Note,  that  criterion  (4.2)  is  a  statistical  equivalent  of  criterion  (4.1)  and  is  intended  to  be  used 
for  large  amounts  of  experimental  data.  Criteria  (4.1)-(4.2)  are  additive  and  this  property  makes 
possible  to  design  an  efficient  optimi2ation  procedure  of  subspace  ranking  according  to  their 
informativity  for  any  arbitrary  dimension.  Corresponding  algorithm  was  developed  and  described  in 
[IR-98].  In  the  fig.4.2-fig.4.3  the  samples  of  printouts  of  spaces  ordering  obtained  on  the  basis  of 
informativity  criteria  (4.1)  and  (4.3)  respectively  are  given. 

In  [IR-98]  the  computational  complexity  of  the  algorithms  of  calculation  of  criteria  (4.1)-(4.3) 
was  adjusted  as  well. 

4.3.  Visual  Design  of  Arbitrary  Classification  Predicates  as  a  Step  Towards  a  New 
Technology  of  Classification  Model  Design 

According  to  the  accepted  methodology  of  prognostic  model  development  based  on  numerical 
experimental  data  at  the  next  step  so-called  classification  predicates  ([Skormin  at  al-97],  [Skormin  at 
al-99],  [IR-98])  are  developed.  Actually,  the  meaning  of  classification  predicates  introduced  below  is 
twofold.  First,  they  form  a  basis  for  the  definition  of  prognostic  rules.  Second,  one  can  consider 
classification  predicates  as  a  feature  that  represents  experimental  data  on  a  binary  scale  instead  of  the 
original  numeric  scale.  The  latter  view  is  very  useful  from  general  Data  Mining  point  of  view:  since 
the  original  experimental  data  contains  both  continuous  and  discrete  columns  (factors),  the  utilization 
of  classification  predicates  facilitates  the  transformation  of  all  columns  of  the  original  experimental 
data  to  a  discrete  format.  This  transformation  is  typical  in  performing  Data  Mining  and  KDD  tasks  but 
approach  considered  below  is  new  one  and  possess  a  number  of  veiy  fruitful  advantages  outlined  in 
the  following  sections. 

Conceptually,  a  classification  predicate  is  viewed  as  the  predicate  associated  with  a 
separation  rule  designed  within  a  subspace  of  low  dimension.  Let  us  recall  that  in  this  study  in  order  to 
facilitate  visualization,  we  consider  only  2-d  subspaces. 

Let  us  consider  projection  of  two  original  clusters  of  experimental  data  on  a  2-d  subspace  of 
factors,  i.e.  onto  a  plane  as  shown  in  fig.  4.4.  Assume  that  a  software  tool  allows  a  user  to  draw  linear 
separation  bounds  that  are  perceived  as  good  or  optimal.  Assume  that  if  a  user  draws  a  linear 
separation  bound  the  software  tool  automatically  generates  the  linear  equation  f(Xi,Xg)  of  the 
corresponding  bound  and  defines  the  appropriate  predicate  as  follows: 

f  if  /*  (x, ,  X,  )  >0  then  is  true 
I  if  /^(X;,JC^)<0thenP^  is  false. 

Geometrically,  (4.4)  implies  that  in  a  half-plane  <x,,Xg>  predicate  is  true  and  it  is  false  in 

the  alternative  half-plane.  It  is  expected  that  a  user  has  the  ability  to  draw  a  number  of  linear 
separation  bounds  and  corresponding  software  tool  automatically  generates  equations  yj(x/,x^),... 

...,f„(x,,Xg)  and  associated  predicates  Pj,  P2,...,P„.  Each  predicate  divides  plane  in  two 

half-planes.  Generally,  plane  <x„Xg>  will  be  divided  into  no  more  than  2”  convex  regions 

Lj,  /  =  iy2,...y2'"  that  do  not  overlap  and  in  combination  cover  the  entire  subspace  <x,,Xg>.  Within 

each  region  Lf,  /=7,2,...,2”  exactly  one  conjunction  of  the  length  m  of  predicates  Pj,  P2,...,P„ 

taken  with  and  without  negation  is  true.  Hence,  each  region  L^,  i  =  7,2,... ,2"  is  defined  formally  by  a 

conjunction  of  predicates  (4.4),  an  arbitrary  combination  of  such  regions  is  defined  by  disjunction  of 
above-mentioned  conjunctions.  One  should  understand  that  if  a  software  tool  provides  a  user  with  the 
capability  to  define  a  number  of  visually-justified  linear  separation  bounds  and  associated  predicates, 
then  the  user  is  capable  to  design  a  very  wide  class  of  separation  rules.  This  class  contains  linear  and 
polygon-like  bounds  which  may  correspond  to  an  arbitrary  convex  and  non-convex  regions.  The  latter 
may  be  obtained  as  a  combination  of  convex  regions  of  truth  of  some  above  mentioned  conjunctions. 
To  illustrate  this  concept  let  us  consider  the  situation  depicted  in  fig.  4.5. 
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Fig.  4.2.  Printouts  of  histogram  and  list  of  2-dimensional  subspaces 
ordered  according  to  (4.1)  measure  of  informativity. 
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Fig.  4.3.  Printouts  of  histogram  and  list  of  2-dimensional  subspaces 
ordered  according  to  (4.3)  measure  of  informativity. 
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Fig.  4.4.  Printouts  of  (1)  visual  interface  for  drawing  linear  separation  boundary  (top 

picture),  (2)  automatically  generated  classification  predicate  (middle  picture)  and 
(3)  results  of  assessment  of  quality  (probabilistic  properties)  of  the  generated 
classification  predicate  (bottom  picture).  Cases  of  different  clusters  are  denoted  by 
signs  of  different  colors. 
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Fig.  4.5.  Printouts  of  (1)  visual  interface  for  drawing  polygon-like  separation  boundary 
(top  picture),  (2)  automatically  generated  non-linear  classification  predicate 
(middle  picture)  and  (3)  results  of  assessment  of  quality  (probabilistic  properties) 
of  the  generated  classification  predicate  (bottom  picture).  Cases  of  different 
clusters  are  denoted  by  signs  of  different  colors. 


4.  Knowledge  Discovery  from  Statistical  Data  Base  forHealth  Assessment  System  Design 


Assume  that  a  user  defines  three  linear  separation  bounds  by  visualizing  the  computer-generated 
clustering  pattern  in  the  plane  <Xu,  Xi5>.  The  predicates  P,  and  P2  associated  with  these  bounds  are 
represented  formally  as  follows: 

Pj  =  (0.226 Xj,  +  0.947 Xjs  -  29.52  >  0) , 

P2  =  (-0.256X^1  -I-  0.967 Xjs  - 14.79  >  0) . 

Both  of  them  are  assigned  by  value  of  "true”  within  the  half-plane  located  above  from  the 
corresponding  linear  bounds.  Hence,  the  white  region  corresponds  to  the  truth  domain  of  both 
predicates  P,  and  •  The  green  colored  region  is  non-linear  and  non-convex  and  is  specified  by  the 
following  logic  formula  given  over  predicates  Pj  and  : 

CP,-hP,&PJv(P,&-^PM^Pl&^P2) 


The  separation  bound  and  the  corresponding  predicate  CP  is  designed  usually  to  separate  cases 
of  two  clusters  as  good  as  possible.  The  last  requirement  can  be  specified  formally  as  follows: 


N J1>  MjQ  ,  MqQ  >  NqI  , 


(4.5) 


where  -  is  the  number  of  realizations  of  cluster  "1"  for  which  predicate  CP;  is  assigned  the  value 
’’true";  Nqj  is  the  number  of  realizations  of  cluster  "1"  for  which  predicate  CP,  is  assigned  the  value 
"false”-,  M,g  is  the  number  of  realizations  of  cluster  ”0"  for  which  predicate  CP,  is  assigned  the 
value  "true”,  and  Mgg  is  the  number  of  realizations  of  cluster  "0”  for  which  predicate  CP,  is  assigned 
the  value  "false".  It  is  clear  that  (N „  +  Mgo)  realizations  of  experimental  data  are  correctly  classified 
by  predicate  CP  ,  and  (M,^  +  Ng,)  realizations  of  data  are  classified  by  predicate  CP,  erroneously. 
For  example,  these  numbers  for  predicate  CP,  are  (see  fig.4.5): 

N  ,,=77,  M  ,g—0  ,  N  g,=0,  Mgg—3. 


Definition  1.  Predicate  that  meets  condition  (4.4)  and  inequalities  (4.5)  is  a  classification  predicate. 


Definition  1  is  non-formal  and  introduces  the  term  that  is  used  elsewhere. 

Based  on  experimental  data  every  classification  predicate  CP^  ,  k=l,2, ...,  m,  can  be  assigned  a 
number  of  attributes  that  represent  the  quality  of  classification  that  it  is  expected  to  achieve.  Let  us 
consider  empirical  estimates  of  probabilities  of  the  correct  and  erroneous  classifications  of  realizations 
of  experimental  data  represented  as  a  matrix: 


P(CP,)  = 


p,(j/i)  Ml/0) 

Mo/i)  P,(o/o) 


(4.6) 


Note  that  the  first  argument  within  the  brackets  corresponds  to  the  decision  made  by  classification 
predicate,  and  the  second  argument  corresponds  to  the  actual  status  of  the  realization  in  question. 
These  estimates  can  be  calculated  as  follows: 

p,(l/l)=N,,(k)/lN,,(k)+M,g  (k)J.  p,(l/0)=M,g(k)/[N,,(k)+M,g(k)]. 

P,  (0  /  l)=Ng,  (kJ/fMgg  (k)+Ng,  m  Pk  (0/0)=  Mgg  (k)/[N g,  (k)+  M gg  (k)] . 

For  example,  for  classification  predicate  CP,  (see  fig.4.2)  the  above  estimates  are  as  follows: 


p,(lll)=l;  p,(l/0)=0 
p,(0/l)=0;  p,(0/0)=l. 

In  fig.  4.6,  fig.4.8,  fig.4.9  and  fig.4.10  one  can  see  a  number  of  samples  of  graphical  synthesis 
of  separation  rules  and  corresponding  classification  predicates  embedded  in  decision  trees  design  (see 
Subsections  4.4  and  4.5).  They  were  designed  for  the  case  study  of  the  developed  technology 


32 


4.  Knowledge  Discovery  from  Statistical  Data  Base  forHealth  Assessment  System  Design 


considered  below.  Compared  to  the  Interim  Report  [IR-98]  a  peculiarity  of  case  study  is  that  it  is  based 
on  statistical  database  generated  by  DDM  developed  in  Section  2. 

Let  us  note  that  one  more  approach  to  the  classification  predicates  synthesis  was  developed  by 
V.Skormin  [Skormin  at  al-99].  In  contrast  to  the  above  polygon-like  separation  rule  design  that  is 
natively  interactive  he  developed  an  “automatic  procedure  resulting  in  ellipse-based  separation  rules. 
This  approach  is  efficient  enough  but  comparing  to  the  former  approach  it  has  to  be  implemented  in 
the  “same  style”,  i.e.  as  a  procedure  supported  by  interactive  visualizing  software.  This  comparison  is 
a  goal  of  the  future  work  in  the  framework  of  Data  Mining  oriented  research. 

4.4.  Forest  of  Decision  Trees  as  a  Step  Towards  Improving  Quality  of  Classification 

The  general  idea  of  decision  tree  was  outlined  in  Section  4.1.  The  decision  tree  development 
procedure  aims  at  finding  the  non-correlated  informative  subspaces  over  the  training  subset  of 
experimental  data.  Let  us  explain  the  term  "correlation"  within  the  context  of  the  Report. 

Let  CP^  and  CPj  be  the  classification  predicates  associated  with  a  selected  pair  of  the  most 
informative  subspaces.  Let  each  of  them  divide  the  entire  set  of  cases  S  into  two  subsets,  i.e.  into 
subspaces  {Sl,  S^}  and  into  subspaces  respectively.  It  may  turn  out  that  Sj»Sj  and  If 

such  non-formal  equalities  are  held  then  die  second  informative  subspace  is  not  able  to  improve 
remarkably  the  classification  procedure.  Hence,  the  second  subspace  is  informative  in  itself  but  is  not 
informative  if  it  is  added  to  the  first  one.  This  example  explains  approximately  in  which  sense  we  use  the 
term  "subspaces  correlation".  The  latter  may  be  specified  formally  but  it  is  not  necessary  because  it  is 
clear  how  to  take  into  account  correlation  of  subspaces  in  the  interactive  visualized  selection 
procedure  to  avoid  utilization  of  highly  "correlated"  subspaces  in  the  classification  procedure.  Let  us 
explain  how  it  can  be  performed. 

In  the  Fig.4.6,  a  sample  of  a  decision  tree  is  depicted.  In  the  first  step  the  developer  has  selected 
the  2-d  subspace  <Xj,Xj2>  as  the  most  informative.  The  upper  screen  (Fig.4.6)  depicts  projections  of 

both  clusters  "0"  and  "1"  onto  the  plane  <x^,Xi2>  and  a  linear  separation  rule  (continuous  line) 

established  by  an  expert  manually  as  the  optimal  one  (The  broken  line  corresponds  to  the  separation  rule 
calculated  automatically  but  rejected  by  Ae  developer.).  This  separation  rule  divides  the  entire  set  of 
training  experimental  data  into  two  non-overlapping  subsets  5^ ,  5^ . 

The  next  steps  of  subspace  selection  are  applied  separately  to  subsets  Sj  and  which  form  two 
new  nodes  of  decision  tree.  Therefore,  on  the  second  and  third  steps  we  have  to  solve  two  tasks  of 
informative  subspaces  selection  for  the  two  above  mentioned  subsets  Sj  and  S2  of  experimental  data.  In 
the  Fig.4.6  the  results  of  the  second  step  selections  are  visualized  and  represented  as  printouts.  The  next 
and  all  further  steps  can  be  realized  in  the  same  way.  As  a  result,  a  decision  tree  depicted  in  the  Fig.4.6  is 
obtained.  This  decision  tree  is  provided  by  all  related  information,  i.e.  set  of  2-d  subspaces,  equations  of 
separation  rules,  classification  predicates,  related  probabilities,  etc.,  obtained  by  the  developed  software 
automatically  (This  information  is  omitted  in  the  fig.4.6).  One  can  see  that  some  separation  bounds  are 
chosen  in  a  nonlinear  form. 

4.5.  Probabilistic  decision  making  procedure 

The  result  of  the  above  procedure  is  tiie  decision  tree  such  that  each  its  leaf  is  mapped  to  a  subset  of 
cases  of  original  training  data.  These  subsets  are  not  overlapping  and  their  union  covers  the  entire  set  of 
training  data.  On  the  other  hand,  each  leaf  is  mapped  to  its  own  classification  predicate  constituted  as 
conjunction  of  classification  predicates  of  decision  tree  nodes  met  along  the  way  from  tree  root  up  to 
respective  leaf  Each  predicate  is  true  in  the  concrete  region  of  the  factor  space,  and  the  regions 
corresponding  to  the  different  leaves  are  not  overlapping  and  cover  the  entire  factor  space.  Hence,  they 
can  be  used  as  the  elementary  events  to  design  a  probabilistic  space.  Let  us  consider  in  more  formal  way 
how  the  probabilistic  space  is  constituted  and  how  it  is  utilized  to  assess  the  probability  of  failure  of  a 
device  having  given  "history  of  abuse". 
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Fig.  4..6.  Decision  tree  for  the  node  of  meta-tree  <Status  of performance  "-1  'V  "O’V  “I  ”> 

(see  fig.  4.1) 
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Let  {Rj,R2,...,Rs}  is  the  set  of  elementary  events  which  are  mapped  to  the  set  of  leaves  of  a 
designed  decision  tree  and  {Pj,  Pg}  is  the  set  of  the  respective  (mapped  to  corresponding  leaves  of 
decision  tree)  classification  predicates.  Each  such  elementary  event  Rie{R],R2,...iRs}  can  be  mapped  to 
an  empirically  estimated  probability  Pj^  (X)=^p(X)  on  the  basis  of  testing  of  designed  decision  tree  over 

both  training  and  testing  data  in  a  traditional  way.  We  suppose  that  these  estimations  are  calculated  as 
confidence  intervals  for  given  level  of  confidence  probability.  In  the  same  way,  confidence  intervals  of 

probabilities  p,  (X/0)  and  can  be  estimated.  Note  that  As  a  result 

for  each  predicate  Pj,  P2,...,  Pg  the  following  probabilities  will  be  estimated  by  confidence  intervals: 


Phil II)  =  PlPhW -'trueVX  ^T],  p,illO)  =  p[P,(X)  ="true"/X  ^’0"],  (4.7) 

pAO/1)  =  p[P,W  ="  false"! X  ^7"],  p,{P i 0)  =  ="  false"! X  ^'0"]. 

The  availability  of  the  probabilities  (4.7)  makes  it  possible  to  calculate  the  target  probability  of 
failure  for  any  point  of  the  factor  space  X*  subject  to  the  condition  that  P^(X  )="true"  using 


Bayes'  formula  as  follows: 


p[\!P,{X*)]  = 


p{l)PHilll) 

p(,l)p,(.l!l)  +  p{0)p,il!0y 


(4.8) 


Here  p(l),  p(0)  -  are  prior  probabilities  of  “border-line”  and  “failure”  of  the  device  operation.  It  is 
obvious  that  probability  of  normal  operation  under  the  adverse  exposures  X  is 

pAO!P,ir)\  =  l-lil!PHiX*)\ 

In  the  general  case  when  the  available  size  of  training  and  testing  data  is  too  small  we  are  not  able 
to  obtain  the  satisfactoiy  accuracy  of  estimations  of  task  related  probabilities  to  forecast  a  probability  of 
failure.  However,  the  quality  of  designed  prognostic  model  depends  critically  on  the  above  accuracy. 
Therefore,  we  have  to  undertake  special  efforts  to  provide  the  needed  accuracy.  In  the  next  section  we 
investigate  an  approach  to  cope  with  the  above  problem. 

4.6  Improvement  of  Assessment  of  the  Task-related  Probabilities 

Recall  that  the  main  goal  of  the  prognostic  model  under  development  is  the  reevaluation  of  the 
probability  of  failure  of  an  avionics  module  on  the  basis  of  its  actual  ’’history  of  abuse”  represented  by 
the  vector  of  adverse  exposures.  It  is  elsewhere  adopted  that  the  accuracy  of  the  assessment  of  this 
probability  depends  on  two  factors: 

(1)  the  total  amount  of  experimental  data  that  is  used  for  training  (prognostic  model  design)  and 
testing  (evaluation  of  the  model  quality), 

(2)  quality  of  prognostic  model  and  model-based  decision-making  procedure. 

We  assume  that  in  our  case  the  amount  of  experimental  data  is  small^  and  we  have  no 
information  about  the  distribution  of  realizations  within  data  clusters.  Therefore,  in  a  general  case  even 
the  best  prognostic  model  may  such  that  it  is  not  able  to  provide  precise  probability  assessment. 
However,  we  are  able  to  assign  a  confidence  interval  for  the  probability  of  each  elementary  event 
associated  with  each  leaf  of  the  designed  decision  tree  that  constitute  a  guaranteed  estimation. 

To  narrow  these  confidence  intervals  and,  hence,  to  improve  the  accuracy  of  probability 
assessment,  we  proposed  the  following  two  approaches: 

•  Use  of  a  set  of  prognostic  models  (a  set  of  decision  trees)  that  differ  in  factor  subspaces  utilized  by 
each  decision  tree  of  the  respective  prognostic  model.  This  approach  leads  to  a  collective  of  decision¬ 
making  procedures.  This  way  is  based  on  the  redundancy  of  information  involved  in  a  decision 
making  and  results  in  the  improvement  of  the  probability  accuracy  estimation.  To  realize  such 
approach  it  is  necessary  to  develop  a  special  algorithm  for  joint  processing  of  probabilities  resulting 


^  Otherwise  the  problem  of  improving  of  the  probabilities  assessment  doesn’t  exists. 
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from  each  decision  tree.  We  proposed  to  use  the  algorithm  based  on  the  Algebraic  Bayes'  Network 
(ABN)  approach  developed  in  [Gorodetsld-92],  [Gorodetski  et  al-97]. 

•  It  is  proposed  to  replace  the  traditional  Bayes’  formula  (13)  resulting  in  a  posterior  probability  by  its 
equivalent  developed  on  the  basis  of  methods  of  interval  mathematics  which  facilitates  the  calculation 
of  a  guaranteed  estimate  of  the  probability  of  failure. 

Consider  the  brief  discussion  of  the  algorithms  implementing  the  above  approaches.  In  [IR-98] 
these  algorithms  were  illustrated  by  a  numerical  example  of  the  assessment  of  failure  probability  of  an 
avionics  module.  It  is  understood  that  the  quantities  generated  by  a  decision-making  procedure 
represented  by  a  decision  tree  depend  on  the  choice  of  the  root  and  intermediate  nodes. 
Typically,  we  are  able  to  design  a  number  of  decision  trees  utilizing  different  subspaces  of  the  factor 
space  that  results  in  information  redundancy.  Consider  the  impact  of  this  redundancy  on  the  accuracy  of 
the  prognostic  procedure. 

Assume  that  a  set  of  three  decision  trees  has  been  established  within  a  particuW  prognostic  inodel. 
Consider  the  application  of  this  model  for  the  assessment  of  the  probability  of  failure  of  an  avionics 

module  subjected  to  adverse  exposures  X.  Assume  that  vector  X  results  in  elementary  events  i?,  ,  Rj 


and  R I  or,  using  some  specific  jargon,  belongs  to  the  appropriate  leaves  of  the  first,  second  and  third 
decision  trees.  The  respective  probabilities  are  defined  as 

Rj;  a'  <.p(X)  Sft,',  a](0)ip(X/0)  a](\)ip(X/I)  (4.9) 

R]:  af  £p(X)  <.b].  a](<S)<p(X/0)  <b](0).  a)i\)<.p(X/I)  <.b](l);  (4.10) 

Ri:  al  <.p(X)  <.bl.  4(0)<p(J70)  <*.’(0),  al(\)<.p(X/l)  ^**(1)-  (411) 

It  could  be  seen  that  vector  Z  belongs  to  all  tree  subspaces,  therefore  the  probability  of  the  event 
R’ A  Rj  A  Rl,  i.e.  the  probability  of  the  event  p(Rl  a  Rj  a  R^)  could  be  defined  on  the  basis  of 

Algebraic  Bayes'  Network  (ABN)  approach  developed  in  [Gorodetski  et  al-97]  (see  also  Section  5).  This 
approach  reflects  the  basics  of  the  probability  theory  and  requires  that  the  interval  constraints  (4.9)  - 
(4.1 1)  be  supplemented  by  some  fimdamental  axioms. 

In  the  case  under  consideration,  we  specify  the  interrelationships  between  probabilities  that  are 

defined  by  each  decision  tree  (they  are  given  in  (4.9)  -  (4.11))  and  the  probability  pfRj  a  Rj  aR^). 


Following  the  ABN  approach  we  represent  the  interrelationships  between  probabilities  by  a  Basse 
diagram  [Birkhoff-67]  as  shown  in  fig.4.7.  Denote  the  probabilities  of  events  constituted  by 

intersections  of  events  Rf ,  Rj  and  R^  as  follows: 
p(XeRl)=p(X^ ),  p(XeR'j)=p(X^),  p(Xe  Rl)=p(X^), 
p[(XeRl)&(X€R])]=p(X,X^),p[(XeRl)&(XeRl)]=p(X,X,), 
p[(Xe  R]  )&(Xe  R^  )]=p(Xj^  X^),  p[(XeRl)&(X  e  R]  )&(XeRl  )J=p(X,  X^X,). 

The  following  are  the  interrelationships  between  probabilities  reflecting  the  axioms  of  norm  and 
additivity: 


p(X,)*p(X,)-  p{X,X,)^I 

piX,)*p(X,)-p(.X,X,)  <1 
p(,X,)*p(X,)-p(.X,X,)iU 


p(X,)-p(.X,X,)iiO 

p{X,)-p(X,X,)i:0 

P(X,)-P(X,X,}>0 

p(X,)-p(X,X,)>0 

p(X,)-p(X,X,)l:0 

p(X,)-p(.X,X:)iO 


*p{X2)-^p(^X^)-p(^XiX2)-pi^XiXi)- 

-p(X2X2)*p(X,X2X,)<.I. 


pl,X2X,)-p(X,X2X,)sO, 

p(X,X,)-p(X,X,X,)iO. 

p(jr,z,)-p(x,X;jr,)kO, 

p(X2)-p(.X,X,)-p{X2X,)*p(X,X2X,)^0. 
p(X2)-p(X,X2) -piX2X,)+p(X,X2X,)^0. 
p(X2)-p(XjX2)-piX,X2)-^p(.X,X2X,)iO. 
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and,  as  usual, ^  0,  i  =  1^,3,  p(XjXj)^0,  i,j  — 1,2,3,  p(XjX2Xj)'^0 . 

These  interrelationships  are  viewed  as  the  background  knowledge,  that  could  be  incorporated  in 
the  estimation  of  probability  p(X)=  p^XjX^X^)  in  the  form  of  two  linear  programming  problems  as 

follows: 

1.  minfpiX  1X2X3))  under  constraints  (4.9),  (4.10),  (4.11)  and  (4.12)  (lower  bound);  (4.13) 

2.  max{p(X 2X2X3))  under  constraints  (4.9),  (4.10),  (4.11)  and  (4.12);  (upper  bound).  (4.14) 

Note  that  based  on  experimental  data  it  is  possible  to  assess  intervals  for  probabilities  p{XjX2) , 
p(X]X3)  and  p{X2X3)  as  well.  This  can  be  useful  for  further  narrowing  intervals  of  probabilities  in 
question.  This  approach  to  improving  the  accuracy  of  interval  probabilities  is  very  fruitful.  It  was 
demonstrated  numerically  by  example  in  [IR-98]. 

Now  consider  the  use  of  Bayes'  formula  (4.8)  for  calculation  of  the  posterior  probability 
in  the  case  when  probabilities  p(X/l)  and  p(X/0)  are  given  by  their  confidence  intervals.  Our  goal  is 

to  calculate  the  upper  bound  of  probability  p(l/X). 
Therefore,  the  task  is  to  find  its  maximum  value  subject  to 
constraints 

aj<P(X/l)<b,,  ao<P(XIO)<bo.  (4.15) 

Simple  analysis  of  this  optimization  task  shows  that  it 
is  equivalent  to  the  task  of  maximization  of  the  quotient 
p(X/l)/p(X/0).  This  provides  the  justification  for  the 
following  formula: 

max{p(l  I  X)  =  —  (4- 1 6) 

p(l)bj  +  p(0)ao 

The  detailed  numerical  demonstration  of  the 
developed  technology  for  design  information-based  health 
assessment  system  was  given  in  [IR-98].  Additional 
numerical  results  obtained  for  statistical  database  generated 
by  the  developed  DDM  is  given  in  the  next  subsection.  Statistical  database  itself  contained  learning 
and  testing  Data  is  given  in  Appendix  A2. 

4.7.  Numerical  results 

In  [IR-98]  we  demonstrated  the  developed  technology  of  probability  of  failure  assessment  and 
prognosis  in  detail.  In  this  report  we  omitted  Ais.  Instead,  let  us  consider  additional  numerical  results 
regarding  the  statistical  assessment  of  the  quality  of  decision  making  procedure  over  the  testing 
database.  The  latter  was  generated  on  the  basis  of  DDM  model  developed  in  this  research  and 
described  in  Section  1.  This  database  consists  of  200  cases  including  125  cases  of  the  cluster  “no 
failure  ",  50  cases  of  the  cluster  “border-line  "  and  25  cases  of  the  cluster  “failure  ”  (see  Appendix 
A2).  Note  that  these  cases  were  not  involved  in  the  learning  procedures  resulting  in  decision  trees 
depicted  in  the  fig.4.6,  fig.4.8,  fig.4.9  and  fig.4. 10. 

Numerical  results  of  testing  are  presented  in  tab.4.1  below.  One  can  see  that  each  decision  tree 
provides  the  sufficient  level  of  classification  quality.  Note  that  these  results  were  obtained  on  the  basis 
of  small  training  database.  In  fact,  if  we  were  continuing  learning  procedure  involving  in  this  process 
new  training  data  we  should  be  able  to  reach  much  more  high  level  of  perfect  classification. 

Let  us  comment  data  in  two  last  columns  of  the  table.  The  column  #7  contains  probabilities  of 
classification  (the  latter  are  presented  in  the  column  #2)  for  the  case  of  using  decision  procedure  on  the 
basis  of  decision  trees  voting  according  to  the  rule  “two  of  three”.  In  the  colunm  #8  we  presented 
probabilities  of  classification  for  the  case  if  voting  is  organized  corresponding  to  the  rule  “if  at  least 
one  decision  tree  votes  for  “failure”  then  decision  is  “failure”.  Note  that  in  this  Report  we  don’t 
demonstrate  numerically  use  of  ABN  because  the  latter  was  presented  in  Interim  report  [IR-98]. 


/K 

p(.X,xy  jiX,X,)  \p(X,X,) 


p(.x,)  Pix,-)  p(X,) 

Fig.4.7.  Algebraic  Bayes'  Network 
for  three-  propositional  case 
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^order-line 


‘dilure*’ 


Fig.  4.8.  Decision  tree  #1  for  the  node  of  meta-tree  <Status  of performance  “0”v  “!"> 

(see  fig.  4.1) 
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Fig.  4.9.  Decision  tree  #2  for  the  node  of  meta-tree  <Status  of  performance  "0”v  “i"> 

(see  fig.  4.1) 
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Fig.  4.10.  Decision  tree  #3  for  the  node  of  meta-tree  <Status  of  performance  “0”v  ‘7  ”> 

(see  fig.  4.1) 
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Let  us  stress  again  that  numerical  results  given  in  tab.4.1  were  obtained  over  testing  data  that 
were  not  involved  in  die  learning  procedure  to  design  decision  making  rules. 


Table  4. 1 .  Numerical  results  regarding  to  the  statistical  assessment  of  the 
quality  of  decision  making  procedure  over  the  testing  database. 


Iv 

Node  of 
meta-tree 

Si 

Decision 

treefig. 

4.9 

.  ■■ . ■  ■■  . 

Decision 

tree 

fig.4.10 

■ 

Voting 
“al 
least 
one  for 
failure  ’’ 

Status  of 
performance 
“-1  ”v“0’’v 

t€  J  » 

0.96 

wAmm 

0.04 

0.05 

p(0vl/0vl) 

0.95 

■■■ 

■■■ 

■■■ 

Status  of 
performance 
“0"v“l” 

IHHB 

0.935 

0.91 

0.918 

0.936 

0.952 

msmm 

0.065 

0.09 

0.082 

0.064 

0.048 

fmamt 

0.25 

0.3 

0.19 

0.214 

0.303 

msn^m 

0.75 

0.7 

0.81 

0.786 

0.697 

While  designing  decision  trees  we  restricted  ourselves  only  by  two-level  trees.  It  is  obvious  that 
adding  one  more  level  could  lead  to  a  higher  quality  of  classification  up  to  totally  perfect 
classification.  But  we  aimed  at  demonstration  of  benefit  of  use  of  collective  decision  making. 

Nevertheless,  one  can  see  that  resulting  classification  procedure  possess  high  probabilities  of 
perfect  classification.  Note  that  there  is  no  cases  of  the  cluster  “failure"  that  was  classified  as 
belonging  to  the  cluster  “no  failure”.  This  result  may  by  considered  as  an  advantage  of  three-cluster 
interpretation  of  the  cases  of  database  proposed  in  this  research.  Note  as  well  that  utilization  of 
collective  of  decision  trees  makes  it  possible  to  reach  more  high  level  of  perfect  classification.  In 
Appendix  A2  results  of  testing  of  each  decision  tree  regarding  to  each  case  of  testing  data  and  results  of 
use  voting  procedures  according  to  two  schemes  are  given. 
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5.1.  Introduction 

The  theoretical  foundation  of  Algebraic  Bayes'  Ne^orks  (AB^  ^ 

Knowledge  Engineering  as  a  formal  model  for  experts'  knowledge  formal  spemfication  were 
Serrf  toSifs  in  L  Interim  Report  [IR-98].  It  was  justified  that  model  makes  ixrssible  to 

solve  a  ^  of  formal  specification  of  knowledge  nnder  uncei^  nHS^rSis 

We  come  back  to  thU  problem  in  ibis  Report.  In  addition  to  the  matenal  given  “  ^ 

Report  we  present  numerical  demonstration  of  the  approach  and  on  the  basis  of  the  developed  case 

““‘‘’'  niis  model  is  based  on  probabilistic  approach.  In  traditional  prob^iMe  ^ 

uncertainty  of  expert's  statement  is  described  by  a  real  number  i.e.  f 

exoert’s  statements.  It  is  well  known  and  it  was  shown  by  examples  m  [IR-98]  that  m  i^y  practical 
cJes  expert  is  not  able  at  all  to  estimate  precisely  probabilities  associated  with  ^tements  about 
denendencies  in  a  domain  At  the  best  case  expert  is  able  to  determine  the  lower  and  the  upper  bounds 
S^ve  probSles.  i.e.  to  determine  ^called  interval  probability.  Ibis  ease  doesn't  matdi 

‘^''ws'^^e^ta^'^obSiilities^don't  fix  any  probabilistic  distribution^  If  multitude 
with  assigned  interval  probabilities  is  given  then  even  for  the  case  when  these  e^ta 
mlbTSly  independU  interval  probabilities  of  events  don’t  determine  a  probability  distribution 
bta  an  indefinite  class  of  distributions.  This  peculiarity  reflecj  the  uncertam  nature  of 

expert’s  knowledge  about  a  domain.  To  cope  with  interval-valued  probabilities,  it  was  proposed  a 
nXber  of  approaches.  One  of  them  is  so-called  Algebraic  Bayes’  Network  formd  model  proposed  by 
author  of  thiC  Report  and  considered  in  detail  in  [IR-98].  One  more  source  of  problems  associated  with 
unSrtain  expert  Liowledge  processing  and  representation  is  its  inconsistency  that  m  many  cases  takes 

In  this  Report  we  omitted  the  main  part  of  conceptual  explanation  of  ABN  idea  and  don’t 
concern  to  interrdations  between  ABN  model  and  other  formalisms,  ^ese  aspects  were  Presented  in 
nR-981  in  details  Instead,  below  we  describe  the  ABN  formal  model,  its  components  and  related 
Sorithms  and  focused  on  its  application-oriented  details.  Of  course,  a  part  of  this  section  comcides 
with  material  given  in  [IR-98].  The  main  part  of  material  is  repeated  to  m^e  this  section  m  some 
sense  self-contained  and  comprehensible  in  respect  to  the  application-onented  matenal  and  numencal 

demonstrations. 

5.2.  Properties  of  expert  information 

At  first  phase  of  life  circle  of  any  technical  device,  as  a  rule  there  is  no  statistical  data  to  develop 
its  diagnostic  model.  The  most  frequent  case  is  that  only  experts'  information  may  be  available  if  there 

are  experts  experienced  with  prototypes  or  analogues.  •  ♦  +  *  +i,»  -.xrtpnt  tn  tip 

Unfortunately  human  knowledge  is  often  imprecise  and  mconsistent  to  the  extent  to  be 

possible  to  specify  this  knowledge  in  precise  formal  terms.  The  latter  leads  to  a  number  of  di^cult 
OToblems  in  its  practical  use  within  knowledge-based  decision  support  “ 

diagnostic  systems.  Human  reasoning  is  very  difficult  for  formd  modeling  as  well.  ^Ithou^  e^rts 
knXledge  mid  human  reasoning  formal  specification  is  a  subjects  of  mte^ive  and  deep  research 
during  at  least  last  two  decades  it  remains  to  be  a  hot  problem  in  Knowledge  Engineering  up  to  “ow. 

It  was  noticed  in  [IR-98]  that  the  causes  of  uncertainty  and  inconsistency  of  expert  information 

may  be  as  follows;  . .  -  , 

•  Qualitative  form  of  expert  statements  about  dependencies  over  a  set  of  entities  of  a  domai 

that  makes  its  unique  formalization  impossible;  . 

•  Very  close  semantics  (sense)  of  the  most  part  of  expert’s  statements  that  i^  be  pressed  m 
diverse  terms.  As  a  rule,  different  experts  express  the  same  dependency  m  different  words, 

•  Diversity  of  experts'  experience  what  can  be  a  cause  of  hard  contradictions. 
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•  Incompleteness  of  experts'  knowledge;  in  addition,  experts  are  often  unable  to  verbalize  their 
knowledge. 

It  is  generally  adopted  that  to  formalize  expert's  knowledge,  an  approximate  model  is  to  be  used. 
There  is  no  sense  to  use  any  precise  formal  model  to  formalize  imprecise  information.  It  was  shown  in 
[IR-98]  that  expert  is  not  able  to  estimate  probabilities  of  any  statements  or  combination  of  statements 
precisely,  i.e.  in  terms  of  point-wise  probability  estimation.  In  addition,  it  can  be  demonstrated  by 
numerical  examples  that  probabilistic  models  designed  on  the  basis  of  an  expert  information  specified 
in  terms  of  point-wise  probabilities  should  be  inconsistent  inevitably.  Therefore,  such  models  cannot 
be  used  for  correct  decision  making. 

5.3.  Advantages  of  probabilistic  model  of  expert  information  vs.  fuzzy  one 

There  exist  a  number  of  approaches  for  dealing  with  uncertainty  of  knowledge.  Among  them 
several  pseudo-physical  logic,  many-valued  logic,  fuzzy  logic  and  theory  of  possibilities  ([Zadeh-78], 
[Dubois  et  al  -88])  are  the  most  popular  ones.  But  probabilistic  model  takes  a  noticeable  role  because 
it  has  a  number  of  important  advantages  comparing  to  other  mentioned  ones.  They  are  as  follows: 

•  probabilistic  model  is  based  on  a  well  developed  mathematical  theory,  i.e.  probabilistic  theory; 

•  probabilistic  models  are  computationally  feasible:  due  to  Large  Number  theorems,  we  can  deduce 
and  check  probabilistic  formulae  by  using  repeated  statistical  simulation  of  representative  size; 
model  output  obtained  theoretically  within  probabilistic  approach  may  be  validated  by 
simulation; 

•  between  all  known  formalisms,  probabilistic  methods  provide  the  most  well-developed 
description  of  dependency  which  in  probabilistic  approach  is  described  by  the  notion  of 
conditional  probability. 

Unfortunately,  classical  probabilistic  models  based  on  traditional  axiomatic  approach  are  not 
well  fitted  for  modeling  of  uncertainty  of  expert  information  and  human  reasoning.  The  main  reason  is 
that  it  is  not  clear  how  probabilities  could  be  obtained.  As  a  rule,  empirically  assessed  probabilities  are 
imprecise  and  such  probabilities  can  be  assigned  by  confidence  interval.  A  large  amount  of  expert 
information  cannot  be  expressed  in  any  verbal  form.  The  last  deficiency  is  common  for  all  formal 
models  of  uncertainty. 

In  [IR-98]  we  have  analyzed  the  existing  probability-based  approaches  developed  to  deal  with 
interval  probabilities  ([Dempster-66],  [Shafer-76],  [Fagin  et  al-88],  [Fagin  et  al-89]).  It  was  shown  that 
ABN  formal  model  may  be  considered  as  a  special  case  of  the  approach  developed  in  [Fagin  et  al-88] 
and  [Fagin  et  al-89].  In  this  approach  a  probabilistic  space  is  introduced  in  axiomatic  way  via 
multitude  of  so-called  basic  random  events  that  may  be  dependent  in  probabilistic  sense  and  may  be 
incomplete.  The  latter  means  that  in  such  probabilistic  space  there  exist  random  events  that  do  not 
belong  to  the  algebra  of  event  that  is  generated  by  the  multitude  of  basic  events  and  operations  of 
union  (u),  intersection  (n)  and  complement  (/). 

However,  Fagin’ s  at  al  model  which  uses  interval  probabilistic  measure  of  knowledge 
uncertainty  is  very  difficult  for  implementation  in  practice,  requires  a  technique  to  deal  with  expert’s 
information  as  it  is  and  doesn’t  propose  a  standard  way  of  expert’s  knowledge  representation.  ABN 
model  is  an  attempt  to  overcome  these  problems  and  oriented  on  practical  cases  of  information 
extracted  from  experts. 

5.4.  Concept  of  knowledge  piece  and  background  probabilistic  knowledge 

In  this  subsection  the  basic  concepts  and  notions  to  build  a  model  of  so-called  Algebraic  Bayes' 
Network  (ABN)  are  introduced.  This  model,  in  turn,  is  utilized  for  expert  information  formal 
specification  and  consistent  processing. 

5.4.1.  Denotations 

Let  0f)  =  {Xj,X2,...,x„},  Xj  false, true}  be  a  multitude  of  propositions  and 
F(x^^,x^^,...,x^^)  be  a  formula  from  the  set  of  well  formed  formulae  given  over  where  X= 
{x^^,Xi^,...,Xi^}-  is  a  subset  of  propositions .  Sometimes  we  shall  consider  a  subset 
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X={X(^,x,^  ,...,Xi^}  as  a  tuple  (vector)  or  as  sequence  of  symbols  without  changing  a  denotation  if  its 
sense  will  be  clear  from  the  context 

Propositions  {Xi^,x,^,...,x,^}  can  be  used  with  negation  (for  example,  -i  xj  or  without  it 
(x,  ).  To  avoid  confusion  for  the  case  when  we  use  symbol  x^  in  the  sense  of  argument  name  (it  can 
be  substituted  by  literal  with  negation  or  without  it)  we  denote  the  last  case  as  x^  putting  symbol 

above  corresponded  name.  This  way  is  used  to  denote  a  set  of  arguments,  therefore  X  is  denotation  of 
the  set  of  all  components  of  the  vectorX  For  disjunction  and  conjunction  we  use  symbols  'V'  and  "a" 
respectively.  In  most  cases  the  symbol  "a"  is  omitted. 

Let  X  =  {Xj^,Xj^,...,Xi^}  be  a  set  of  propositions.  Two  subsets  Xj  and  X2  such  that 

Xj  uX2=X  ,  and  XjnX2  =0  are  called  partitions  of  a  set  X  =  {x^^,x^^,...,x^J .  k  symbol  of 
proposition  "x^ "  can  be  interpreted  as  denotation  of  a  random  event  as  well.  Similar,  a  conjunction  of 
propositions  may  be  considered  as  complex  random  event  that  is  realized  if  all  its  arguments 
(propositions  wiA  or  without  negation)  are  assigned  by  truth  value  "true”.  In  such  an  interpretation  of 
a  conjunction  of  propositions  we  may  call  it  as  random  binary  sequence  and  assign  its  components  a 
value  "1"  or  "0"  depending  on  whether  the  corresponded  random  event  is  realized  or  not. 

It  should  be  noted  that  interpretation  of  a  conjunction  of  propositions  either  as  formulae  or  as 
random  event  is  based  on  the  isomorphism  of  algebra  of  logic  formulae  and  algebra  of  random  events 
that  was  discussed  elsewhere  in  the  literature. 

Below  in  the  next  subsection  we  consider  basic  components  of  Algebraic  Bayes’  Network,  i.e. 
so-called  “knowledge  pieces”  of  different  ranks.  In  the  [IR-98]  these  notions  were  discussed  in  details 
and  were  explained  conceptually.  In  this  Report  we  describe  them  and  ANB  itself  in  more  formal  way. 
Instead,  the  practical  application  is  discussed  in  more  depth. 

5.4.2.  Knowledge  Piece  of  rank  2  (two-propositions  knowledge  piece) 

Let  us  consider  a  two-element  multitude  of  propositions  Xfj^ii,j)  =  {x^,X2}.  Proposition 
without  negation  we  call  as  positive  one.  Let  us  introduce  in  a  standard  way  an  order  relation  over  the 
family  set  over  the  set  (i,j)  =  {Xi,Xj},  i.e.  N  =  {{x,.},{x^},{x,,x^}}.  Each  subset  of  this 
family  set  can  be  mapped  to  the  conjunction  constituted  by  positive  propositions  contained  within  it. 
These  propositions  may  be  ordered  in  the  same  way  as  corresponded  subsets  of  the  family  set  N . 
Finally,  let  each  conjunction  be  assigned  a  probability.* 

Definition  5.1.  Knowledge  piece  of  rank  2  is  a  partially  ordered  set  of  positive  conjunctions  that 
corresponds  to  elements  of  family  set  of  the  set  X(2)(i,j )  each  assigned  a  truth  probability,  i.e. 


K^^\xi,Xj)  =  {<  {x,},;7(x;)  >,<  {Xj},p(Xj)  >,<  {x,,x^.},p(x;x^.)  >}.  (5.1) 

Propositions  {Xi,Xj}  constituting  knowledge  piece  K^^fx^,Xj)  are  called  its  arguments.  ■ 


p(XjX2) 


Graphical  representation  of  a  knowledge  piece  K^^fx^.Xj)  of  rank 
2  in  the  form  of  Hasse  diagram  [BirkhofF-67]  is  depicted  in  the  fig.5.1. 
Definition  5.2.  Knowledge  piece  K^^^(x^,Xj)  is  called  consistent  if  the 


p(Xj)  p(X2) 

p(Xi)<l, 

(5.2) 

Fig.5.1.  Hasse 

P(X2)<1, 

(5.3) 

diagram  for 

P(Xi)-p(XjX2)>0, 

(5.4) 

knowledge  piece 

X(2)  ~ 

P(X2)-P(X,X2)>0, 

(5.5) 

*  Let  us  emphasise  that  the  probabilities  have  to  meet  probabilistic  axioms. 
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l.p(Xj)-p(x2)^p(XtX2)>0M  (5.6) 

Inequalities  (5.2)-(5.3)  correspond  to  the  probabilistic  axioms  and  represent  background 
knowledge  of  probability  theory  that  has  to  be  met  in  any  case.  If  these  inequalities  are  held  for 
probabilities  p(Xj),  p(x2),  p(XjX2)  then  there  exist  an  assignment  of  probabilities  of  all  other 
formulae  defined  over  propositions  x,  ,  Xj  which  probabilities  meet  probabilistic  axioms. 

5.4.3.  Knowledge  piece  of  rank  3  (three-propositions  knowledge  piece) 

The  notion  of  knowledge  piece  of  the  rank  3  is  defined  like  one  of  the  rank  2. 

Definition  5.3.  Knowledge  piece  of  rank  3  is  a  partially  ordered  set  of  positive  conjunctions  that 
corresponds  to  elements  of  the  family  set  of  the  set  ^  (i,j,k)  =  {Xj,Xj,Xf^}  each  assigned  a 

truth  probability,  i.e. 


K^^\x^,xj,x^)  =  {<{xf,p(x,)>,<{xj},p(xj)>,<{x^},p(x^)>,<{x„xj}, 

p(XiXj)>,<{Xi,X^},p(XiX^)>,<{Xj,Xk},p(.XjXk>,<{Xi,Xj,X^),p(x,XjX^}>}. 


Propositions  {Xj,Xj,Xi^}  constituting  knowledge  piece  K^^fx^,Xj,x^)  are  called  its 
arguments.  ■ 

Graphical  representation  of  a  knowledge  piece 
p(x  1X2X3)  K^^fXi,Xj,x^)  of  rank  3  is  depicted  in  the  fig.5.2. 

Definition  5.4.  Knowledge  piece  (x^.Xj.Xi^)  is  called 

Xj)  consistent  if  all  knowledge  pieces  of  rank  2  contained  in  it 
are  consistent  according  to  the  Definition  5.2  (i.e.  knowledge 

pieces  K^^fx^.Xj),  K^^\xi,x^),  K^^'‘{Xj,xf)  are 
consistent)  and  in  addition  the  following  constraints  are  met: 

PiXjX^  )  -  p(x,XjX^  )>0,  (5.8) 

PiXiX^  )  -  pix.XjX^  )>0,  (5.9) 

p{XiXj)-p(x,X2X3)>0,  (5.10) 

Pixk)-pix,xf)-p{xjxf^+p{x,xjxf^>0,  (5.11) 

PiXj)-p{x,Xj)-p{XjXf)+p{XtXjXfi>0,  (5.12) 

PiXi)- pix^XJ)- p{XiXf)  +  piXiXjXf^>0 ,  (5.13) 

l-piXi)-p{XJ)-p{xf)+p{x,XJ)  +  p{x^X^)  +  p{XJXf)-p{x^XJXfi  >0m  (5.14) 


Fig.  .5.2.  Basse  diagram 
for  knowledge  piece 

X(3j  =  (Xj  ,X2,X3} 


Consistency  conditions  formulated  in  the  Definition  5.4  for  knowledge  piece  of  rank  3  have  the 
same  sense  how  it  was  conunented  for  consistency  conditions  of  knowledge  piece  of  rank  2.  Let  us 
repeat  them  in  brief.  If  knowledge  piece  of  rank  3  is  consistent  then  there  exists  a  consistent 
assignment  of  truth  probabilities  of  all  other  formulae  that  may  be  defined  over  the  same  propositions^. 
Inequalities  (5.9)-(5.14)  represent  background  knowledge  of  probability  theory  regarding  to  the 
probabilities  associated  with  knowledge  piece  of  rank  3 . 

We  can  define  the  notion  of  knowledge  piece  of  any  arbitrary  rank  but  no  more  than  the  number 
of  atomic  propositions  in  the  set  0q  (see  [IR-98]). 

In  Appendix  A3  the  full  form  of  consistency  conditions  are  given  for  the  knowledge  pieces  up  to 
the  rank  4. 


^  It  is  shown  below  that  they  can  be  calculated  uniquely. 
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Fortunately,  in  practice  of  knowledge  engineering  we  may  restrict  ourselves  by  considering 
knowledge  pieces  of  rank  of  not  more  than  4  (see  [IR-98]).  On  the  one  hand,  it  was  discovered 
experimentally  that  expert  is  not  able  to  assess  dependency  over  more  than  three  atomic  statements. 
On  the  other  hand,  if  a  rank  of  knowledge  piece  is  increased  then  reliability  of  the  involved  probability 
assessment  decreases  significantly.  Therefore  we  may  restrict  ourselves  by  considering  of  knowledge 
pieces  of  the  rank  no  more  than  3.  Below  it  is  shown  that  for  consistency  maintenance  we  need  to  deal 
with  knowledge  pieces  of  rank  4  ([IR-98)]. 

5.5.  Algebraic  Bayes’  Network;  Formal  Definition 

The  above  materials  form  a  foundation  for  introduction  a  notion  of  the  Algebraic  Bayes’ 
Network  that  is  basic  one  for  design  of  algorithms  of  expert  information  processing.  In  this  subsection 
the  corresponding  formal  framework  is  described. 

It  was  mentioned  above  that  experts  are  able  to  talk  reliably  about  dependencies  over  no  more 
then  three  atomic  statements,  i.e.  about  truth  probabilities  of  formulae  determined  over  sets  of 
propositions  =  (xj ,  X(2)(i>j)  =  {x^Xj}  and  Xpj(i,j,k)  =  {x^,Xj,x^}.  This 

experimentally  inferred  conclusion  makes  possible  to  restrict  ourselves  by  knowledge  pieces  of  rank  2 
and  3  as  standard  patterns  of  knowledge.  The  latter  can  serve  as  justification  for  assumption  that  after 
processing  of  fragments  of  expert  information  aimed  at  consistency  maintenance  the  result  may  be 
represented  as  a  set  of  consistent  knowledge  pieces  as  follows: 

KB  .  {K^^^(x,,Xj)}I,  j^J.  {Kf'^(x,.Xj,x,),  j.,^J  }.  (5.15) 

However,  different  instances  of  knowledge  pieces  in  (5.15)  may  contain  the  same  propositions 
and,  hence,  may  be  dependent.  It  means  that  they  have  to  be  structured  in  a  formal  framework  in  such 
a  way  that  it  should  be  possible  to  check  and  maintain  consistency  of  the  entire  multitude  of 
knowledge  pieces.  For  example,  the  same  conjunctions  can  be  contained  in  a  number  of  knowledge 
pieces.  Hence,  they  have  to  be  assigned  the  same  intervals  of  truth  probability  but  the  latter  may  be  not 
held  in  for  knowledge  pieces  contained  in  (5.15)  because  they  might  be  extracted  from  different 
experts  and/or  independently  on  each  other.  This  means  that  a  multitude  of  above  knowledge  pieces 
may  be  inconsistent  and,  hence,  doesn’t  represent  a  knowledge  base.  This  multitude  is  a  "half-finished 
product"  only  and  further  processing  is  needed  to  constitute  knowledge  base. 

As  a  structure  for  joint  representation  of  a  multitude  of  knowledge  pieces  that  makes  it  possible 
to  detect  and  eliminate  contradictions  of  knowledge  pieces  we  propose  to  use  a  so-called  "Algebraic 
Bayes'  Network"  (ABN)  structure  ([Gorodetski-92],  [Gorodetski  et  al-97]). 

Definition  5.5.  Let  us  call  an  Algebraic  Bayes'  Network  a  set  of  consistent  knowledge  pieces 

{K<‘>(x„Xj,x^),jt^Jj 

structured  as  Hasse  diagram  (semi-lattice)  ([Birkhof-85])  added  with 

•  a  multitude  of  constraints  conditioned  by  probabilistic  axioms  over  truth  probabilities  of 
all  conjunctions  that  are  contained  in  the  above  multitude  KB;  (they  form  so-called 
background  knowledge); 

•  algebra  of  formulae;  and 

•  inclusion-exclusion  formulae  for  truth  probabilities  of  formulae  of  the  multitude  KB.  ■ 

Let  us  explain  the  introduced  notion  by  example. 

Example  5.5.1. 

Let  ABN  consist  of  four  knowledge  pieces  formed  due  to  expert  information  about  possible 
dependencies  over  sbc  propositions  {Xj,X2,....Xg}  ([Gorodetski  et  al-97]): 

KB={K^^\xy,X2,X2),  K^^\x2,X2,Xg) ,  K^^\xg,Xg),  K^^\xg,Xg)}. 


^  Denotation  “KB  ”  for  this  set  emphasizes  the  fact  that  it  can  be  considered  as  knowledge  base. 
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Graphical  presentation  of  these  knowledge  pieces  is  given  in  the  fig.  5.3.  In  the  fig.5.4  graphical 
representation  of  the  corresponding  (structured)  ABN  is  given. 

It  can  be  shown  that  ABN  model  is  a  special  case  of  model  known  as  extended  probabilistic 
space  ([Fagin  et  al-88]).  Indeed,  ABN  is  an  extended  probabilistic  space  under  the  following 
assumptions: 

1.  A  multitude  of  basic  events  (algebra  of  family  set)  isomorphous  to  the  algebra  of  logic 
formulae  is  not  known.  Interval  assessments  of  truth  probabilities  are  known  for  a  subset  of 
logic  formulae  over  set  of  atomic  propositions  0g  =  {x,,X2,...,x„} ; 

2.  ABN  contains  only  positive  conjunctions  of  atomic  propositions  of  a  length  no  more  than  3. 

The  necessity  of  both  above  assumptions  is  conditioned  by  a  specific  of  application  that  has  to 

deal  with  expert  information  processing.  Indeed,  the  first  assumption  is  conditioned  by  a  limit  of 
information  that  can  be  extracted  from  experts,  and  the  second  one  reflects  the  restricted  possibility  of 
an  expert  to  discover  dependencies.  It  should  be  noted  that  a  model  of  extended  probabilistic  space 
([Fagin  et  al-88])  requires  too  much  information  to  be  practically  helpful  within  a  knowledge 
engineering  tasks. 


p(XjX2Xj) 


K^^^(Xj,X2,X2): 

p(Xj)  s  [0.5,10],  p(x2)e[0.6,0.8], 
pCxs)  e[0.9,1.0],  p(XjX2)  g[0.1,0.8], 
p(x,Xj)e [0.4,10],  p(x,x,)e[0.5,0.8], 

p(XjX2X3)  e  [0,0.8] . 

Fig.5.3.a 


pix^Xs) 


(x^,x^)\ 

p(xj  G  [0,1.0],  p(Xj)  G  [0,1.0], 

P(x^Xs)g[0,0.8]  . 

Fig.5.3.c 


p(X2X2X^) 


K^^^(x2,X2,X4): 

P(x2)g [0.5,07 ],  p(x3)g [0.8, 1.0] , 
p(xj  e[0.3,0.5],  p(x2X2)  g[0.5,0.7], 
P(X2X2)  G [0,0.2],  p(XjX^)  e[0.3,0.5], 
pix^x^x^)  G [0,0.2]. 

Fig.5.3.6 


P(XsXg) 


K<^^(Xs,xJ: 

P(x,)g  [0,1.0],  p(xj&  [0,1.0], 
P(XsX6)g  [0,0.5]. 
Fig.5.3.cr 


Fig.5.3.  A  multitude  of  knowledge  pieces  with  assigned  truth  probabilities  of  the  formulae 
corresponded  to  its  nodes  (components  of  ABN)  given  below  in  fig.5.4. 
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pix^x^x^)  pix^x^x^) 


Fig.5.4.  An  example  of  graphical  representation  of  ABN  contained  four 
Imowledge  pieces. 

5.6.  Consistency  of  Algebraic  Bayes’  Networks:  Background  Knowledge 

5.6.1.  Internal  Consistency  of  Algebraic  Bayes'  Network 

In  Definition  5.5  ABN  was  determined  as  structured  multitude  of  consistent  knowledge  pieces. 
It  can  be  shown  that  consistency  of  each  knowledge  piece  does  not  guarantee  consistency  of  ABN  in 
strict  sense.  Let  n  be  a  cardinality  of  the  multitude  of  propositions  Oq  =  {x,,X2,...,x„}  that  are 
arguments  of  ABN.  To  check  consistency  of  such  ABN  in  the  strict  probabilistic  sense,  it  should  be 
necessary  to  form  a  knowledge  piece  of  rank  n  and  to  check  its  consistency  in  the  same  way  which 
was  specified  above  for  knowledge  pieces  of  rank  2  and  3.  But  ABN  of  such  rank  consists  of 
{2" -if  elements  and  probabilistic  background  knowledge  imposes  3"-l  equations  or 
inequalities.  Even  for  relatively  low  value  of  n  the  tasks  of  checking  consistency  (checking  the 
existence  of  a  solution  of  the  corresponding  inequalities)  is  too  complex..  It  is  clear  that  such  size  of 
constraint  satisfaction  task  is  too  high  to  be  used  in  practice.  However,  dimensions  of  real  life 
applications  may  be  much  more.  This  means  that  "direct  way"  of  consistency  checking  and 
maintaining  is  infeasible. 

On  the  other  hand,  as  a  rule,  expert's  probabilities  are  very  approximate  and  incomplete  and  this 
is  a  justification  for  use  of  more  weak  condition  for  checking  and  maintaining  consistency.  To  assess 
consistency,  we  propose  to  utilize  a  notion  of  linearly  ordered  set  of  ABN  consistency  degrees. 

Definition  5.6.  Algebraic  Bayes'  Network  is  locally  consistent  if  and  only  if  all  its  knowledge 
pieces  are  consistent.  ■ 

Local  consistency  of  ABN  is  the  minimal  (the  weakest)  degree  of  its  consistency.  I  should  be 
noted  that  local  consistency  is  provided  by  Definition  5.5  of  ABN.  It  is  obvious  that  local  consistency 
is  a  necessary  consistency  condition  of  ABN. 

It  was  mentioned  above  that  different  knowledge  pieces  may  contain  common  conjunctions.  For 
example,  ABN  depicted  in  the  fig.5.4  contain  conjunctions  ,  x^,  ,  x^ ,  Xj ,  x^  that  are 

included  in  several  knowledge  pieces.  To  be  locally  consistent,  such  conjunctions  can  be  assigned  by 
distinct  values  of  truth  probabilities  in  different  knowledge  pieces  because  experts  can  provide  a 
contradictory  information  and  different  knowledge  pieces  can  be  formed  by  experts  of  different 
experience.  Within  ABN  common  conjunctions  have  to  be  equal. 

Definition  5. 7.  Two  knowledge  pieces  of  an  ABN  are  called  coordinated  if  truth  probabilities  of  all 
common  conjunctions  within  these  knowledge  pieces  are  assigned  equal  values.  ■. 

Definition  5.8.  Locally  consistent  ABN  is  called  internally  consistent  if  and  only  if  all  its 
knowledge  pieces  are  coordinated.  I 


*  This  number  is  equal  to  the  total  number  of  conjunctions  of  length  no  more  then  n  without  empty  conjunction. 
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Algorithm  of  checking  and  maintenance  of  ABN  internal  consistency  is  relatively  simple.  Let 
us  describe  it  idea. 

Let  all  knowledge  pieces  be  enumerated  by  indexes  from  1  to  M,  where  M  is  a  total  number  of 
knowledge  pieces  in  ABN.  Let  be  a  multitude  of  all  knowledge  pieces.  Let  us  denote  each 
knowledge  piece  number  i  by  KP ,  i €1,2,. ..,M. 

Algorithm  of  maintenance  of  internal  consistency  of ABN: 

1.  In  the  multitude  Q  select  a  knowledge  piece  KP  ^  indexed  by  minimal  number  i  and  then 
select  all  knowledge  pieces  having  indexes  k>i,  and  constituting  nonempty  multitude  of 
common  conjunctions  together  with  KP  j  .  Let  us  denote  the  multitude  of  such  knowledge 
pieces  including  KP ,  as  jKP^,  ; 

2.  Form  a  list  of  common  conjunctions  Con^  of  the  multitude  and  for  each  conjunction 
C  j  €  Con  compute  intersections  of  intervals  of  truth  probability  assigned  to  each  conjunction 
within  all  knowledge  pieces  of  the  multitude  KP^  (This  operation  intend  to  make  all 
knowledge  pieces  coordinated).  If  all  above  intersections  are  nonempty  then  assign  the 
resulted  (coordinated)  intervals  to  the  corresponding  conjunctions.  Otherwise,  ABN  is  internally 
inconsistent; 

3.  Delete  knowledge  piece  KP ,  from  the  multitude  /2.  If  is  nonempty  then  go  to  the  step  2;. 

4.  Solve  tasks  of  local  consistency  maintenance  for  each  KP  ^  that  contains  at  least  one  interval  of 
truth  probability  assigned  a  new  value  in  the  step  2.  ■ 

After  running  steps  1  -  4  the  possible  outputs  can  be  as  follows: 

•  either  iwconjM’^e/JcyofABN  is  ascertained; 

•  either  coordination  of  all  knowledge  pieces  is  ascertained  and  therefore  ABN  internal 
consistency  is  reached; 

•  or  not  all  knowledge  pieces  turns  out  coordinated.  In  the  last  case  all  steps  1-4  have  to 
be  repeated  once  more. 

The  above  algorithm  converge  to  the  decision  under  search  because  all  intervals  are  changed 
monotonically  (they  may  become  only  more  narrow)  and  value  of  intervals  are  restricted  from  below 
by  empty  set. 

Example  5. 6.1. 

Let  us  demonstrate  the  algorithm  of  internal  consistency  maintenance  for  ABN  depicted  in  the 
fig.5.4  assigned  by  truth  values  given  in  the  fig.5.3  ([Gorodetski  et  al-  97]). 

The  first  run  through  the  steps  1-3. 

Consider  the  first  pair  of  knowledge  pieces,  i.e.  K^^^ (Xi,X2Xj)  and  K^^^(x2,X2X^).They 
have  three  common  conjunctions  but  only  one  is  needed  to  be  coordinated.  The  result  is  as 
follows: 

P(x2)^[0.6.0.7]. 

Two  other  common  conjimctions  preserve  their  truth  probability  intervals,  i.e. 

p*(x,)  B [0.9, 1.0],  P(X2X2)  €[0.5,07] . 

The  second  run  through  the  steps  1-3. 

Consider  the  second  pair  of  knowledge  pieces,  i.e.  K^^^ (X2,X2X^)  h  K^^^  (x^,x^)  .Yox^em 
one  interval  of  truth  probability  is  modified  and  takes  value 

p\x,)€{0.3,0.5]. 
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The  third  run  through  the  steps  1-3. 

Consider  the  pair  of  knowledge  pieces  K^^^(x4,Xs)  h  K^^^(Xs,xJ.  Their  common 
conjunction  x^  is  coordinated.  Resulting  value  of  its  interval  of  truth  probability  is  preserved; 

p(Xs)  e  [0,1.0] . 

Run  of  the  step  4. 

Run  procedures  of  local  consistency  maintenance  for  all  knowledge  pieces  of  ABN.  The  final 
result  is  as  follows  ([Gorodetski  et  al-97]): 


K^^^(xj,X2,X3): 

p(Xj)  G [ 0.5,10] ,  p(x2)g[ 0.6, 0.7] , 
pfx^)  e[0.9,1.0],  p(xjX2)^[0.1,0.7], 
p(x,x, )  6  [0.4,10] ,  p(x2X, )  G  [ 0.5, 0.7] , 

p(x,X2X2)e[0,0.7] . 

K^^^(X4,Xs)'. 

p(xj  &  [0.3, 0.5] ,  p(Xj)  &  [0.5, 0.7] , 
p(x^Xj)e[0,0.5]. 


K^^^(X2,X2,X^) 

P(X2)  €[0.6,0.7]  ,  P(X2)  e[0.9,1.0], 
p(x^ )s[ 0.3, 0.5] ,  p( X2X2 )e[ 0.5. 0.7] , 
p(x2xj  e[0,0.2],  p(x3xj  e[0.3,0.5] , 

P(X2X3XJ  G  [0,0.2] . 

K^^^(Xj,Xg) 

p(x,)  e[0.5,0.7],  p(xje[0.3,0.5], 
P(XsX6)e  [0,0.5]. 


After  running  the  step  4  of  algorithm  all  conjunctions  are  assigned  by  coordinated  values  of 
truth  probabilities.  Therefore,  the  resulting  ABN  is  internally  consistent.  ■ 

The  above  algorithm  does  not  cover  the  case  when  there  exist  a  number  of  order  relations  over 
truth  probabilities  of  conjunctions.  This  case  was  considered  in  [IR-98]  and  we  demonstrate  it 
numerically  below  within  the  case  study  related  to  application  of  ABN  to  the  carburetor  of  a  car 
engine  diagnosis  in  Subsection  5. 7. 


Unfortunately,  internal  consistency  of  ABN  doesn't  guarantee  the  strict  consistency  of  the  latter. 
Internal  consistency  condition  is  only  necessary  but  not  sufficient.  Let  us  consider  example 
demonstrating  the  case  when  ABN  is  internally  consistent  but  not  consistent  in  the  strict  sense. 


Example  5.6.2. 

Let  us  consider  ABN  depicted  in  the  fig.5.5  that  consists  of  three  knowledge  pieces  of  rank  2 
([Gorodetski  et  al-97])  and  which  conjunctions  are  assigned  truth  value  intervals  as  follows: 

p(Xj)e[0.5,1.0],  p(x2)  =  0.5,  p(x3)  =  0.5,  p(XiX2)  =  0.5 ,  p(x,Xj)  =  0.5,  = 


P(XjX2)  p(x,X2)  P(X2X3) 


Fig.5.5.  An  example  of  internally  consistent  ABN  that 
is  inconsistent  in  strict  sense 

It  can  be  shown  that  this  ABN  is  inconsistent.  In  accordance  with  Definition  5.10  this  ABN  is 
internally  consistent.  However,  in  this  ABN  there  does  not  exists  any  consistent  assignment  for  truth 
probability  of  conjunction  x 3X2X3  that  meets  the  probabilistic  axioms  imposed  on  this  conjunction, 

i.e.  interval  of  truth  probability  pix  3X3X3 )  of  this  ABN  is  empty.  ■ 
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5.6.2.  External  consistency 

Let  us  introduce  one  more  degree  of  ABN  consistency  that  corresponds  to  the  next  degree  of 
consistency  compared  to  the  both  local  and  internal  degrees  of  ABN  consistency. 

Definition  5.10.  Internally  consistent  ABN  is  called  externally  consistent  if  assignment  of  truth 
probabilities  of  all  its  maximal  elements  is  consistent.  ■ 

Let  us  first  explain  new  terms  used  in  the  Definition  5.10.  In  Definition  5.10  a  notion  of 
maximal  element  of  ABN  is  introduced.  This  notion  is  used  in  the  lattice  theory  sense.  A  Basse 
diagram  ([Birkhof-67])  of  ABN  that  is  used  for  graphical  representation  of  the  latter  is  a  semi-lattice 
and,  saying  informally,  an  element  of  ABN  is  the  maximal  one  if  it  corresponds  to  a  conjunction  of 
maximal  length  within  a  knowledge  piece  to  which  it  belongs.  It  should  be  noted  that  a  maximal 
element  can  belong  to  an  only  knowledge  piece  and  each  knowledge  piece  may  contain  only  maximal 
element  ([Birkhof-67]).  For  example,  in  ABN  depicted  in  5.4  maximal  elements  are  ,  x^x^x^, 

and  x^Xg .  It  is  obvious  that  the  maximal  element  of  a  knowledge  piece  determine  uniquely  the 
structure  of  the  latter. 

Let  us  now  explain  the  term  "externally  consistent  assignments  of  truth  probabilities  of  all 
maximal  elements”.  Each  maximal  element  and  corresponding  knowledge  piece  may  be  considered  as 
contained  in  a  knowledge  piece  of  a  higher  rank.  For  example,  knowledge  pieces  K^^^(Xi,X2) , 
K^^^(Xj,X3)  and  (X2,X2)  that  form  ABN  depicted  on  the  fig.5.4  may  be  considered  as 
contained  within  knowledge  piece  K^^^(Xj,X2,X2)  of  rank  3  depicted  in  the  fig.5.5.  Knowledge 
pieces  depicted  in  the  fig.S.Sa,  5.3.6  may  be  considered  as  contained  within  the  knowledge  piece  of 
rank  4  depicted  in  the  fig.5.7.  In  the  both  above  mentioned  figures  dotted  lines  correspond  to  the  edges 
of  Basse  diagram  that  are  not  contained  in  the  original  knowledge  pieces. 

Let  us  consider  an  example  to  demonstrate  the  basic  idea  of  notion  of  external  consistency. 

Example  5.6.3. 

Let  us  consider  ABN  containing  a  multitude  of  knowledge  pieces  depicted  in  the  fig.  5.5  and 
marked  by  unbroken  lines.  This  ABN  is  internally  consistent  and  truth  probabilities  of  its  conjunctions 
are  as  follows: 

p(Xj)e[0.5,0.6],  p(x2)s[0.5.0.6],  p(x2)  e[0:5,0.6] , 

(5.16) 

P(XjX2)g[0.5,0.6] ,  p(x,X3)e [0,0.5],  p(x2Xj)  b[0.4,0.6] . 


In  the  original  ABN  a  truth  probability  of  the  conjunction  is  absent  but  it  has  to  meet 

constraints  imposed  by  background  probabilistic  knowledge  (5.2)  -  (5.6)  and  (5.8)-(5.14).  It  should  be 
noted  that  a  subset  of  these  constraints  has  been  already  met  in  the  original  AlBN  because  the  latter  is 
internally  consistent.  If  intervals  of  some  truth  probabilities  were  narrowed,  then,  nevertheless,  already 

met  constraints  would  not  be  broken.  For  example  (see  fig. 


p(XjX2X3) 


P(Xi)  P(X2)  p(X3) 


5.6)  the  probabilities  p(x,X2),  p(x,X2),  p(x2X3)  are 
consistent  internally  within  corresponding  knowledge 
pieces  of  rank  2.  To  meet  the  external  consistency 
conditions,  the  following  constraints  have  been  held: 


P(X2Xs)-P(XjX2X2)>0,  (5.17) 

p(XjX2)-p(XjX2Xj)>0,  (5.18) 

p(XiX2)-p(x,X2X3)>0,  (5.19) 

p( X3 ) -p(x,X3)^  p( X2 X3 )  +P(X2X2X3)  >  0 ,  (5.20) 


p(  X2)-p(  XiX2)-p( X2X3)+p(  X 1X2X3  )>0,  (5.21) 


Fig.5.6.Knowledge  piece  of  rank  3  that 
contains  maximal  elements  of 
knowledge  pieces  of  rank  2 


P(Xi)-p(XiX2)-p(XiX3)+p(XiX2X3)>0,  (5.22) 

i-p(x,)-p(X2)-p(X3)  +  p(XjX2)+p(XjX3)  + 

+  p(X2X3)-p(x,X2X3)>0.  (5.23) 
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This  task  is  simple  enough.  It  results  in  externally  consistent  solution  that  differs  from  the 
original  internally  consistent  ABN  only  in  the  estimation  of  truth  probability  of  the  conjimction  XjX^ 
that  is  p(  XjXj )  e[  03,0.5  ]  .■ 

Thus,  a  task  of  checking  and  maintenance  of  external  consistency  of  an  ABN  is  reduced  to  the 
checking  and  maintenance  consistency  of  a  knowledge  piece  of  minimal  rank  that  contains 
corresponding  maximal  elements  of  ABN. 

Let  us  consider  the  task  in  a  more  formal  way. 

Definition  5.12.  Knowledge  piece  of  minimal  rank  that  contains  maximal  elements  of  a  given 

p(XiX  2X3X4) 


Fig.5.7.  Knowledge  piece  of  rank  4  that  contains  maximal 
elements  of  knowledge  pieces  of  rank  3 

multitude  of  knowledge  pieces  K=  {K^‘^  }f,,,  is  called  minimal  external  knowledge  piece  for  the 
multitude  K .  ■ 

In  fig. 5. 6  and  fig5.7  the  minimal  external  knowledge  pieces  of  rank  3  and  4  respectively  are 
depicted. 

Let  us  remind  that  we  consider  ABN  that  contains  knowledge  pieces  of  rank  of  no  more  then 
3.  Therefore,  we  can  restrict  ourselves  by  the  following  particular  cases: 

1.  External  consistency  of  a  subset  of  knowledge  pieces  of  rank  2  contained  in  a  minimal  external 
knowledge  piece  of  rank  3.  It  is  clear  that  every  minimal  external  knowledge  piece  may  contain 
no  more  than  3  knowledge  pieces  of  rank  2; 

2.  External  consistency  of  a  subset  of  knowledge  pieces  of  rank  3  contained  in  a  minimal  external 
knowledge  piece  of  rank  4.  In  this  case  a  minimal  external  knowledge  piece  may  contain  two, 
three  or  four  knowledge  pieces  of  rank  3; 

3.  External  consistency  of  a  subset  of  knowledge  pieces  that  contains  knowledge  pieces  either  rank 
2  or  rank  3.  The  number  of  variants  of  knowledge  pieces  in  this  case  is  more  than  in  the 
previous  two  ones. 

Let  us  consider  above  listed  special  cases. 
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1.  External  consistency  of  ABN  that  contains  knowledge  pieces  of  ranks  no  more  than  2 

In  this  case  the  task  is  reduced  to  one  that  deals  only  with  internal  consistency  of  knowledge 
pieces  of  rank  3.  Algorithm  of  the  task  solving  was  demonstrated  in  the  example  5.6.3  and  consists  in 
meeting  constraints  (5.17)  -  (5.23).  In  principal,  the  set  of  constraints  (5.17)-  (5.23)  can  be  simplified 
if  there  is  no  necessity  to  estimate  intervals  of  truth  probabilities  of  conjunctions  that  are  not  contained 

in  the  original  ABN.  For  example,  if  ABN  consists  of  two  knowledge  pieces,  say,  K^^^(Xj,X2), 
K^^^(Xj,Xj)  (see  fig.5.6),  then  there  is  no  need  to  estimate  probabilities  p(x2Xj)  h 
This  task  is  slightly  simpler,  however,  in  such  approach  we  do  not  preserve  a  standard  form  of  task 
formulation.  To  preserve  it,  one  has  to  take  into  account  the  full  multitude  of  constraints. 

Figures  5.6  and  5.7  cover  all  special  cases  of  ABN  that  contains  knowledge  pieces  of  rank  of  no 
more  than  2. 

2.  External  consistency  of  ABN  that  contains  knowledge  pieces  of  rank  3 

Fig.5.7  makes  possible  to  enumerate  all  cases  that  may  be  met  while  dealing  with  external 
consistency  problem  solving  for  ABN  that  contains  knowledge  pieces  of  ranks  no  more  than  3.  An 
original  ABN  can  contain  any  pairs,  triples  of  knowledge  pieces  of  rank  3.  But  it  is  a  reason  to 
preserve  the  standard  formulation  of  a  the  set  of  constraints  and  does  not  distinct  particular  cases. 

Thus,  for  general  case  of  the  task  under  consideration  it  is  necessary  to  use  constraints  of  the 
multitude  which  are  presented  in4RP^«<^«:^5  of  the  Report. 

3.  External  consistency  of  a  subset  of  knowledge  pieces  that  contains  a  mixture  of  knowledge  pieces  of 
rank  2  and  3. 

This  case  is  reduced  to  the  two  previous  ones.  It  is  clear  that  if  ABN  contains  at  least  one 
knowledge  piece  of  rank  3  then  a  minimal  external  knowledge  piece  is  of  rank  4  what  allows  us  to 
reduce  this  case  to  the  task  considered  above. 

5.7.  Case  study  of  Experts’  Information  Processing:  Car  Engine  Diagnostics 

In  previous  subsections  of  this  section  we  considered  theoretical  basic  of  Algebraic  Bayes’ 
Networks  (ABN).  This  structure  was  developed  to  be  used  as  a  formal  framework  to  deal  with  expert’s 
information  to  design  knowledge  and  to  integrate  together  experts’  knowledge  and  other  one  which 
obtained  via  Data  Mining  and  Knowledge  Discoveiy  from  statistical  databases.  Peculiarity  of  expert’s 
information  and  problem  of  its  use  for  development  of  integrated  knowledge  base  were  considered  in 
detail  in  Interim  Report  [IR-98].  In  addition  the  problem  was  outlined  in  this  Report  in  Subsection  5.2. 
Below  we  describe  in  brief  and  demonstrate  technology  of  expert’s  information  processing  on  the 
basis  of  the  ABN  framework. 

Let  0Q  =  {Xj,X2,...,x„},  Xj  false, true}  be  a  multitude  of  propositions  that  formalize 
domain  factors,  for  example,  those  that  determine  status  of  performance  of  a  device  and 
F(X)=  F(x^^,Xi^,...,Xj^)  be  a  logic  formula  from  the  set  of  well  formed  formulae  given  over 

where  X=  }  —  is  a  subset  of  propositions .  It  was  justified  in  [IR-98]  that  if  we 

intend  to  extract  knowledge  from  an  expert  we  have  to  take  into  account  that  expert  is  able  to  reply 
only  on  relatively  simple  and  clear  questions  about  dependencies  existing  in  a  domain.  The 
dependencies  he/she  is  able  to  talk  about  are  usually  relations  given  over  no  more  than  two  or  at  best 
three  variables.  If  we  intend  to  use  ABN  as  the  formal  framework  for  uncertain  experts’  knowledge 
processing  and  representation  the  above  questions  are  about  experts’  assessment  of  probabilities  of 
occurrence  of  those  or  others  events  given  status  of  device  performance,  say,  “no  failure”  and 
“failure  ”. 

ABN  is  partially  ordered  set  of  positive  conjunctions  of  propositions  from  the  set  assigned 
by  probabilities  of  truth.  But  in  practice  experts  are  able  to  talk  not  only  about  probabilities  of  positive 
conjunctions  of  propositions  (complex  random  events).  Instead,  they  may  possess  knowledge  about 
probabilities  of  conjunction  containing  literals  with  and  without  negation,  about  probabilities  of 
disjunctions  that,  in  its  turn,  may  comprise  of  positive  and  negative  literals.  For  example,  if  we  ask  an 
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expert  about  eventual  probabilistic  dependencies  existing  over  two  statements  (factors,  features),  we 
may  await  replies  about  the  following  probabilities  [IR-98]: 

P(fl)  =  P(Xl),  P(f2)  =  P(X2)y  P(f3)  =  P(-^i)’  P(f4)  =  P(-^2)’ 

P(f5)  =  P(XlX2),  P(f6)  =  p(^l^2)^  P(f7)  =  P(Xl-^2)> 

P(f8)~P(~^I~^2)'  P(f9)  —  P(^I  ^^2),  p(fio)~P(~^l  ^^2)’ 

P(fu)  =  P(Xi^-^2)>  P(fl2)  =  P(^l  '^-^2)- 

It  is  clear  what  probabilities  we  may  deal  with  for  the  case  of  three-variables  dependencies. 

Therefore,  one  of  the  problem  of  expert’s  information  processing  is  how  to  obtain  ABN  model 
if  information  consists  of  a  number  of  experts’  statements  about  probabilities  of  formulae  given  over 
arbitrary  two-and  three-  proposition  logic  formulae.  The  corresponding  equations  and  algorithms 
were  considered  in  [IR-98].  In  Appendix  3  of  this  Report  they  are  given  again.  Below  we  outline 
corresponding  algorithms  and  demonstrate  technology  of  experts’  information  processing  aimed  at 
ANB  consistent  formal  model  design. 

5.7.1.  Experts’  Information:  Car  engine  carburetor  diagnostics. 

Let  we  intend  to  design  ABN  aims  at  solving  task  of  carburetor  of  a  car  engine  diagnostics.  We 
suppose  that  carburetor  status  can  take  two  values;  “no  failure”  and  “failure”.  Below  in  the  table  5.1 
in  the  column  #2  the  denotation  and  physical  sense  of  factors  observed  by  driver  are  given.  They  are 
given  as  the  statements  of  natural  language  in  quotation  marks.  Symbol  of  negation  “-1”  reflects  the 
fact  that  corresponding  statement  in  quotation  marks  is  false.  Columns  #3  and  #4  corresponds  to  the 
experts’  assessment  of  the  respective  interval  valued  probabilities  of  truth  of  the  statements  given  in 
the  column  #2  for  status  of  carburetor  “failure”  and  “no  failure”  respectively. 


Table  5.1.  Experts’  information  about  subject  domain  “Car  Engine  Diagnostics:  Carburetor”. 


No 

Denotation  of  experts  ’  expressions  on  natural 
language 

p(.f  failure 

=pm 

pij  /"no  failuri') 

=p(f/0) 

1 

Xj :  “Choking  of  cold  or  warmed  engine” 

[0.2-0.6] 

[0.0-0.3] 

2 

—CX, :  -i{“Choking  of  cold  or  warmed  engine”) 

[0.4-0.8] 

3 

X2 :  “Engine  shut  down  before  reaching  nominal 
temperature" 

[0-0.3] 

■1 

—1X2 :  -^{“Engine  shut  down  before  reaching  nominal 
temperature") 

[0.6-1 .0] 

5 

Xj :  “Difficult  firing  of  warmed  up  engine” 

[0.4-0.8] 

[0.2-0.4] 

6 

-1X3  :  -,(“Difficult  firing  of  warmed  up  engine”) 

[0.2-0.6] 

7 

X^ :  “  Unsteady  engine  run  at  no  load” 

[0.2-0.6] 

[0.0-0.05] 

8 

-iX^ :  -,i“Unstead!y  engine  run  at  no  load’) 

[0.3-0.7] 

9 

Xf :  “Jerking  movement  at  constant  speed’ 

[0. 1-0.4] 

[0.0-0. 1] 

10 

— iX^  :  -f" Jerking  movement  at  constant  speed") 

[0.6-0.9] 

11 

Xg :  “Jerking  acceleration" 

[0.2-0.5] 

[0. 1-0.3] 

12 

—iXg :  -{“Jerking  acceleration") 

[0.5-0.8] 

13 

Xj  :  “Low  acceleration  at  movement" 

[0.6-0.9] 

[0. 1-0.3] 

14 

—1X7 :  -{“Low  acceleration  at  movement") 

[0. 1-0.4] 

[0.2-0.6] 
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No 

Denotation  of  experts  ’  expressions  on  natural 
language 

P(f !"  foilure 

p(f  /"no  failuri') 
=p(f/0) 

15 

Xg :  “Highest  level  of  engine  power  is  not  achievable. 

Engine  troubles  at  highest  load” 

[0.5-0.8] 

16 

-CCg  :  ->(“ Highest  level  of  engine  power  is  not  achievable. 
Engine  troubles  at  highest  load”). 

[0.2-0.5] 

17 

Xg-.  “Running  of  engine  after  shut  down” 

[0.0-0.3] 

18 

—Xg  :  -^‘Running  of  engine  after  shut  down”) 

[0.7-1 .0]] 

19 

XjQ :  Sputtering  engine 

[0.0-0.3] 

20 

-Xjg  :-i”( Sputtering  engine”) 

[0.7-1 .0] 

21 

X,i :  Abnormally  high  level  of  fuel  consumption 

[0.6-0.9] 

22 

—Xjj :  -tf'Abnormally  high  level  of  fuel  consumption”) 

[0. 1-0.4] 

■ 

23 

Xg  Xjj 

[0.4-0.8] 

24 

Xg  Xg 

[0. 1-0.5] 

25 

XjXg 

[0. 1-0.5] 

26 

X?  Xg  Xjj 

[0.2-0.5] 

27 

X7  Xg 

[0.4-0.8] 

28 

X7XJI 

[0.4-0.8] 

29 

Xg  X7  Xg 

[0.0-0.4] 

30 

X6X7 

[0. 1-0.6] 

31 

Xg  X,  Xjj 

[0. 1-0.4] 

32 

XgXn 

[0. 1-0.5] 

33 

Xj  X, 

[0.2-0.6] 

34 

XjXs 

[0.1-0.6] 

35 

Xg  Xjj 

[0.3-0.7] 

Remark:  Empty  positions  of  the  tab.5.1  correspond  to  the  probability  interval  [0,1]. 

In  addition  to  the  data  given  in  tab.5.1  the  following  order  relations  given  over  probabilities  of 
truth  of  the  formulae  were  extracted  from  experts: 


pi^Xj )  >  2  X  p(^Xg  ),  24^ 

5xp(X^oX,,)>p(X,). 

Data  in  tab.5.1  and  order  relation  (5.24)  forms  experts’  information  that  has  to  be  taken  into 
account  for  knowledge  base  design. 

Let  us  note  that  below  we  demonstrate  technology  of  ABN  design  and  its  consistency 
maintenance  only  for  the  experts’  information  related  to  the  status  “failure”  of  device  “Carburetor”, 
because  this  case  is  much  more  complex  than  the  case  related  to  the  status  “no  failure  ”. 

The  demonstrated  below  technology  of  experts’  information  processing  to  design  knowledge 
base  and  corresponding  numerical  results  are  obtained  on  the  basis  of  the  developed  software.  The 
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software  was  developed  in  the  environment  Visual  C-H-  and  Access  97  Data  Base  Management 
System, 

5. 7.2.  Consistency  maintenance  Procedure:  Numerical  Results 

The  first  step  of  processing  of  the  data  given  in  the  tab.5.1  is  \jiowledge  pieces  extraction. 
Printout  of  this  step  result  is  depicted  in  the  fig.5.8.  In  addition  the  list  of  the  knowledge  pieces  of  rank 
1, 2  and  3  is  given  in  tab.5.2  in  the  second  and  third  columns.  It  can  be  seen  that  experts’  information 
leads  to  the  Algebraic  Bayes’  Network  comprising  10  knowledge  pieces. 

Next  step  according  to  the  developed  technology  corresponds  to  the  ABN  local  consistency 
maintenance  Aat  is  calculation  of  the  locally  consistent  probabilities  assigned  to  the  nodes  of  the 
ABN.  To  realize  this  step  it  is  necessary  to  use  equations  given  in  Appendix  A3.  These  equations  are 

labeled  as  for  knowledge  pieces  of  rank  2  and  as  EI^^-EJ^^  for  knowledge  piece  of 

rank  3.  Of  course,  it  is  necessary  to  use  only  those  of  them  that  correspond  to  the  experts’  information 
containing  in  the  tab.5.1.  One  can  see  that  in  our  case  there  is  no  necessity  to  use  equations  containing 
disjunctions  of  propositions. 

Equations  E/^^-E/^^  and  have  to  be  considered  as  constraints  for  pairs  of 

linear  programming  tasks  formulated  for  each  node  of  the  corresponding  knowledge  piece.  This 
procedure  was  described  in  detail  in  Interim  Report  [IR-98].  The  results  of  calculation  of  the  truth 
probabilities  of  ABN  nodes  are  given  in  the  fifth  column  of  tab.5.2.  It  should  be  noted  that  resulting 
ABN  is  locally  consistent. 

The  third  step  is  maintenance  of  the  ABN  internal  consistency.  The  algorithm  of  this  task 
solution  was  described  above  in  Subsection  5.6.  Let  us  remind  that  the  algorithm  aims  at  calculating  of 
the  coordinated  probabilities  for  the  nodes  which  belong  to  several  (more  than  to  one)  knowledge 
pieces.  The  results  of  calculations  are  presented  in  the  fifth  column  of  tab.5.2. 

The  last  step  is  checking  and  if  necessary  maintenance  of  the  external  consistency.  This 
procedure  was  described  in  Interim  Report  [IR-98]  and  outlined  in  Subsection  5.6  above.  Results  of 
implementation  of  the  procedure  are  given  in  the  sixth  column  of  tab.5.2. 


Table  5.2.  Knowledge  pieces  extracted  from  experts’  information  and  probabilities  assigned  to 
their  nodes  in  the  consequent  steps  of  experts’  information  processing:  Status  “failwe". 


# 

Knowledge 

piece 

maximal 

node 

Nodes 

Probability 
corresponding 
to  the  source 
data 

Probability 
corresponding 
to  the  local 
consistency 

Probability 
corresponding 
to  the  internal 
consistency 

Probability 
corresponding 
to  the  external 
consistency 

1 

^7 

[0.2;0.5] 

[0.2;0.5] 

[0.2;0.5] 

[0.2;0.5] 

X7 

[0.6;0.9] 

[0.6;0.9] 

[0.6;0.9] 

[0.6;0.9] 

[0.5;0.8] 

[0.5;0.8] 

[0.5;0.8] 

[0.5;0.8] 

MBBI  ’ 

[0.1;0.5] 

[0.1;0.5] 

[0.1;0.5] 

[0.1;0.5] 

[0.1;0.5] 

X7  Xg 

[0.4;0.8] 

[0.4;0.8] 

[0.4;0.8] 

[0.4;0.8] 

Xd  X7  Xg 

[0.0;0.4] 

[0.0;0.4] 

[0.0;0.4] 

[0.0;0.4] 

2 

X;  Xu 

^d 

[0.2;0.5] 

[0.2;0.5] 

[0.2;0.5] 

[0.2;0.5] 

X7 

[0.6;0.9] 

[0.6;0.9] 

[0.6;0.9] 

[0.6;0.9] 

Xil 

[0.6;0.9] 

[0.6;0.9] 

[0.6;0.9] 

[0.6;0.9] 

XeXn 

[0.1;0.5] 

[0.1;0.5] 

[0.1;0.5] 

[0.1;0.5] 
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# 

Knowledge 

Probability 

piece 

Nodes 

corresponding 

maximal 

to  the  source 

node 

data 

g 

c 

Probability 

orresponding 

c 

t( 

t( 

[0.4;0.8] 


[0.1;0.4] 


[0.6;0.9] 


[0.5;0.8] 


[0.6;0.9] 


[0.4;0.8] 


[0.4;0.8] 


[0.4;0.8] 


[0.2;0.5] 


[0.0;0.3] 


[0.6;0.9] 


{0.0, 1.0] 


{0.0,  l.O] 


{0.0, 1.0] 


{0.0, 1.0] 


[0.2;0.6] 


[0.4;0.8] 


[0.1;0.5] 


[0.2;0.6] 


[0.4;0.8] 


[0.1;0.4] 


[0.6;0.9] 


[0.5;0.8] 


[0.6;0.9] 


[0.4;0.8] 


[0.4;0.8] 


[0.4;0.8] 


[0.2;0.5] 


[0.4;0.8] 


[0.1;0.4] 


[0.6;0.9] 


[0.5;0.8] 


[0.6;0.9] 


[0.4;0.8] 


[0.4;0.8] 


[0.4;0.8] 


[0.2;0.5] 


[0.35;0.6] 


[0.07;0.3] 


[0.0;0.25] 


[0.2;0.6] 


[0.4;0.8] 


[0.1;0.5] 


[0.2;0.6] 


[0.07;0.3] 


[0.0;0.25] 


[0.2;0.6] 


[0.4;0.8] 


[0.1;0.5] 


[0.2;0.6] 


[0.2;0.6] 

[0.2;0.6] 

[0.2;0.6] 

[0.2;0.6] 

[0.4;0.8] 

[0.4;0.8] 

[0.4;0.8] 

[0.4;0.8] 

[0.1;0.4] 

[0.1;0.4] 

[0.1;0.4] 

[0.1;0.4] 

[0.1;0.6] 

[0.1;0.4] 

[0.1;0.4] 

[0.1;0.4] 

[0.4;0.8] 

[0.4;0.8] 

[0.4;0.8] 

[0.4;0.8] 

[0.6;0.9] 

[0.6;0.9] 

[0.6;0.9] 

[0.6;0.9] 

[0.3;0.7] 

[0.3;0.7] 

[0.3;0.7] 

[0.3;0.7] 

[0.6;0.9] 

[0.6;0.9] 

[0.6;0.9] 

[0.6;0.9] 

[0.0;0.3] 

[0.0;0.3] 

[0.0;0.3] 

[0.0;0.3] 

[0.0;0.3] 

[0.0;0.3] 

[0.0;0.3] 

[0.0;0.3] 
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|l  Experts*  and  formal  knowledge  processing. 


'rPiodicale* - 

- 1  Formulae 

||  |®| 

!!  nobabiUy 

N0T(X1 ) 

[0.40 

0.80] 

U 

NOT(XIO) 

[0.70 

1.00] 

U 

N0T(X11 ) 

[0.10 

0.40] 

U 

N0T(X2) 

[0.60 

1.00] 

U 

N0T(X3j 

[020 

0.60] 

U 

N0T(X4) 

[0.30 

0.70] 

U 

N0T(X5) 

[060 

0.80] 

U 

N0T(X6) 

[0.50 

0.80] 

U 

N0T(X7) 

[0.10 

0.40] 

U 

NOT(Xe) 

[0.20 

0.50] 

U 

N0T(X9) 

[0.70 

1.00] 

U 

XI 

[0.20 

0.60] 

U 

X1ANDX3 

[0.10 

0.50] 

U 

X1ANDX4 

[0.20 

0.60] 

U 

X10 

[0.00 

0.30] 

U 

XII 

[0.60 

0.90] 

U: 

XII  AND  X3 

[0.30 

0.70] 

U 

XII  AND  X6 

[0.10 

0.50] 

U 

X11ANDX6ANDX7 

[0.10 

0.40] 

U 

XII  AND  X7 
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Fig.5.8.  Printouts  of  experts’  source  information  and  of  results  of 
extraction  knowledge  nieces  from  exoerts’  information 
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In  tab.5.2  the  lines  of  gray  color  corresponds  to  the  probabilities  which  values  are  changed 
compared  to  the  information  extracted  from  experts. 

5.8.  Consistent  integration  of  statistical  and  expert  information  within  diagnostic  model 

Algebraic  Bayes’  Network  is  a  structure  for  representation  of  any  information  under 
uncertainty.  In  many  practically  interesting  cases  such  a  way  of  uncertain  knowledge  specification  is 
advantageous.  Therefore  expert  information  is  not  only  type  of  information  which  be  represented 
consistently  in  the  frameworks  of  ABN.  It  was  mentioned  in  Section  4  that  knowledge  extracted  from 
statistical  data  contains  a  lot  of  sources  of  uncertainty  as  well.  In  fact,  frequently  we  have  sampling  of 
statistical  data  of  very  small  size,  in  particular,  within  applications  like  new  hardware  diagnostic 
model  design  that  is  the  subject  of  this  Project.  As  a  rule  probabilistic  measure  of  any  statement 
calculated  on  the  basis  of  statistical  database  and/or  extracted  from  expert  can  be  estimated  only  as 
confidence  interval  because  point-wise  estimations  of  a  probability  in  the  most  cases  is  very 
inaccuracy  and  um-eliable. 

Knowledge  extracted  from  statistical  data  in  a  form  of  logic  formulae  assigned  interval 
probabilities  of  truth  (they  were  called  classifying  predicates  in  Section  4)  can  be  represented  in  the 
framework  of  Algebraic  Bayes'  Networks  like  it  was  described  in  this  section  for  expert  information. 
Hence,  all  advantages  of  ABN-based  representation  and  processing  of  expert  information  are  valid  for 
knowledge  extracted  from  statistical  data.  The  main  advantage  of  ABN  is  that  statistically  estimated 
confidence  intervals  combined  with  experts’  assessments  of  interval  probabilities  of  correlating 
variables  (statements,  features,  factors)  may  be  narrowed  remarkably  what  corresponds  to  more  high 
accuracy  of  a  target  diagnostic  model.  A  conclusion  is  that  ABN  is  a  convenient  framework  to  join 
knowledge  extracted  from  experts’  information  and  one  assessed  over  statistical  data. 

Applied  to  the  new  hardware  diagnostic  model  development  the  above  advantage  of  ABN 
formal  model  plays  a  very  significant  role.  The  reason  to  ascertain  its  benefit  in  such  application  is 
obvious:  insufficiency  of  statistical  data  and  insufficiency  and  uncertainty  of  expert  information.  To  be 
integrated  together  in  a  consistent  way  within  the  framework  of  ABN,  they  are  able  to  improve 
significantly  diagnostic  model. 
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6.1.  Contribution  of  the  research 

The  research  presented  in  this  report  is  aimed  at  development  of  mathematical  models 
associated  with  the  technology  for  information  based  health  assessment  system  and  numerical 
verification  of  these  models.  Up  to  the  time  when  the  real  statistical  database  is  accumulated 
we  need  mathematical  model  to  generate  adequate  database  that  makes  it  possible  to  verify 
developed  algorithms  and  corresponding  technology  and  to  do  further  research  aimed  at 
specializing  algorithms  for  new  concrete  type  of  device.  Of  course,  each  new  device  will 
require  to  develop  “ad  hoc”  model.  Nevertheless,  the  general  principles  and  ideas  of 
development  task  related  model  may  be  borrowed  from  the  Dynamic  Data  model  developed 
in  this  research  and  presented  in  Section  2. 

However  the  major  task  of  this  research  is  development  of  technology  for  the  accurate 
assessment  of  the  probability  of  failure  of  hardware,  such  as  avionics,  on  the  basis  of  its 
known  «history  of  abuse»  by  environmental  and  operational  factors  and  assessment  of  the 
residual  performance  resource.  The  successful  solution  of  both  problems  allows  us  to  forecast 
the  probability  of  failure  during  a  forthcoming  sortie  and  to  assess  the  residual  performance 
resource  thus  providing  a  quantitative  basis  for  mission  planning  and  timely  maintenance  as 
well  as  preventing  emergencies.  This  application  cannot  be  regarded  as  a  conventional 
reliability  problem  because  classical  reliability  does  not  view  exposure  to  specific 
environmental  conditions  and  operational  factors  as  a  main  cause  of  failures.  The  problem 
stated  herein  does  not  constitute  a  conventional  prognostic  task  also  because  the  failure  may 
not  occur  at  all.  The  problem  statement  considered  in  the  Report  was  for  the  first  formulated 
in  the  paper  [Skormin  et  al  -97].  Such  a  problem  statement  is  prompted  by  the  modem 
concept  of  maintenance  known  as  the  «service  when  needed». 

It  is  expected  that  the  prognostic  model  presented  in  this  Report  is  developed  on  the 
basis  of  information  downloaded  from  dedicated  monitoring  systems  of  flight-critical 
hardware  and  stored  in  a  database.  Therefore,  the  stated  problem  is  related  to  the  area  of  tasks 
of  Data  Mining  and  KDD  ([Frawly  et  al  -91],  [Matheus  et  al  93],  [Fayyad  et  al-95-1],  [Fayyad 
et  al  95-2],  [Bradley  et  al-98-1]).  According  to  the  existing  topics  of  Data  Mining  prognostic 
model  design  is  a  classification  problem  ([Fayyad  et  al  95-1]).  Classification  problem  of  such 
kind  is  well  known  and  is  being  investigated  at  least  during  four  decades  ([Fukunaga-72], 
[Patrick-72],  [Tou  et  al  -74],  [Ryin-76]). 

Nevertheless,  a  number  of  principle  tasks  of  classification  are  still  of  great  interest  and 
deserve  further  investigation.  For  example,  a  hot  area  of  classification  is  the  so-called 
problem  of  feature  informativity  and  algorithms  of  their  selection  as  well  as  methods  and 
algorithms  of  learning  for  synthesis  of  classification  rule  [Bradley  et  al  -98-2].  In  addition, 
there  exist  a  number  of  problems  very  important  from  the  applications  point  of  view  that  still 
do  not  have  efficient  solutions.  For  example,  development  of  classification  models  based  on 
Data  Mining  and  KDD  for  the  case  when  databases  contain  columns  measured  on  both 
continuous  and  discrete  scales. 

Let  us  summarize  the  new  results  presented  in  this  research  and  described  in  [IR-98] 
and  in  this  Report  that  constitute  its  main  contribution  to  the  applied  classification  problem 
solving  and  to  the  area  of  Data  Mining  and  KDD. 

1.  The  basis  of  the  classification  model  proposed  in  the  Report  is  formed  by  so-called 
classification  predicates.  Firstly  the  idea  was  proposed  by  V.  Skormin  and  L.  Popyack  in 
[Skormin  et  al  -97].  They  are  associated  with  subspaces  of  factors  of  low  dimension,  in 
particular,  with  2-d  subspaces.  Classification  predicates  are  defined  over  the  entire  factor 
space.  Each  classification  predicate  divides  the  latter  into  two  regions  according  its  truth 
values  {«tnie»  and  «false»)  in  such  way  that  each  region  contains  mostly  realizations  of  one 
of  two  clusters  of  data.  A  classification  predicate  is  true  within  a  region  of  the  factor  space 
bounded  by  a  set  of  separation  functions  of  two  arguments,  which  are  particular  components 
of  the  entire  factor  space.  For  a  2-d  separation  bound,  the  efficient  procedure  resulting  in 
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optimal  separation  functions  of  arbitraiy  shape  (including  non-  convex  case)  and  associated 
classification  predicates  is  investigated.  This  procedure  is  based  on  the  visualization  of 
cluster  projections  onto  arbitrary  2-d  subspaces,  and  is  implemented  in  an  interactive 
software  tool  developed  in  the  framework  of  this  research.  This  procedure  provides  a  user 
an  opportunity  to  draw  any  separation  rule  manually  utilizing  its  approximation  by  a 
polygon,  i.e.  an  arbitrary  linear  spline.  Moreover,  the  regions  established  by  this  procedure 
could  be  many  -  connected  and  non  -  convex.  A  user  is  required  to  draw  a  separation  bound 
while  the  software  tool  generates  the  associated  classification  predicate  automatically. 

2.  A  decision  tree-like  model  of  the  classification  procedure  is  proposed  and  implemented  in 
numerically  efficient  software.  The  peculiarity  of  decision  trees  presented  in  Interim  Report 
[IR-98]  in  and  this  Report  is  that  it  is  binary  and  consists  of  a  number  of  ranked 
classification  predicates.  Each  predicate  is  associated  with  a  node  of  the  tree  and  subsets  of 
database  realizations  that  belong  to  the  region  of  factor  space  where  corresponding 
classification  predicate  is  «true».  The  above  regions  of  factor  space  and  subsets  of 
realizations  are  ranked.  The  set  of  regions  and  corresponding  subsets  of  realizations 
associated  with  the  leaves  of  a  decision  tree  are  not  overlapping  and  their  combination 
covers  the  entire  factor  space  and  the  set  of  realizations  respectively.  The  last  property 
provides  an  opportunity  to  introduce,  in  a  natural  way,  a  set  of  elementary  events  and  the 
corresponding  probabilistic  space  that  constitutes  a  model  for  the  assessment  of  the 
probability  of  failure  of  hardware,  i.e.  to  solve  the  target  task.  In  addition,  the  notion  of  meta 
-  tree  used  in  this  research  and  described  in  section  4  makes  it  possible  to  solve 
classification  task  for  arbitrary  number  of  clusters  of  data  without  any  change  of  the 
technology  of  the  above  decision  tree  design. 

3.  The  research  presented  in  this  paper  is  aimed  at  designing  a  model  for  reliable  assessment 
of  the  probability  of  failure.  A  pure  statistical  approach  in  the  case  of  a  small  amount  of 
training  and  testing  data  is  not  sufficient  for  providing  the  necessary  accuracy  and  reliability 
of  failure  prognosis.  Therefore,  this  paper  suggests  utilizing  a  number  of  different  decision 
trees  that  are  supposed  to  be  used  to  form  a  more  accurate  collective  decision.  Each  decision 
tree  consists  of  a  number  of  ranked  classification  predicates  associated  with  2-d  subspaces 
of  factors.  The  most  important  requirement  is  that  each  decision  tree  has  to  be  associated 
with  different  subspaces  of  factors.  Information  redundancy  is  a  reason  for  a  possible 
accuracy  enhancement  of  the  assessment  of  the  probability  of  failure.  To  employ  the  idea  of 
redundancy,  a  special  procedure  of  joint  processing  of  the  decisions  obtained  by  individual 
decision  trees  is  investigated.  It  is  based  on  the  concept  of  so-called  «Algebraic  Bayes’ 
Networks))  developed  by  the  author  ([Gorodetski-92],  [Gorodetski  et  al  -97]).  In  fact, 
estimation  enhancement  is  achieved  due  to  utilizing  the  background  knowledge.  This 
method  is  demonstrated  numerically.  Additional  utilization  of  interval  mathematics  methods 
to  calculate  the  posterior  probability  of  failure  on  the  basis  of  Bayes’  formula  makes  it 
possible  to  obtain  the  precise  upper  (minimal)  bound  of  the  target  probability. 

It  should  be  noted  that  the  classification  rule  development  technique  presented  could  be 
efficiently  used  in  a  wide  area  of  applied  tasks  of  Data  Mining  and  KDD. 

4.  The  utilization  of  the  concept  of  a  classification  predicate  defined  over  subspaces  of  low 
dimensions  makes  possible  to  develop  a  totally  new  approach  to  the  task  of  rules  extraction 
from  databases  in  the  most  complex  case.  One  such  case  is  the  situation  when  a  database 
contains  columns  specified  both  in  continuous  and  discrete  scales.  It  is  well  known  that  now 
this  task  is  one  of  the  key  problems  of  Data  Mining  and  KDD.  Any  known  and 
conventionally  used  approach  aiming  at  the  creation  of  a  knowledge  base  by  «mining))  a 
database  containing  both  continuous  and  discrete  data  is  based  on  direct  discretization  of 
continuous  (real  valued)  data  and  results  in  substantial  dimension  increase  of  the  factor 
space.  Consequently,  such  an  approach  leads  to  inefficient  algorithms  and  can  not  be 
recognized  as  a  satisfactory  one  even  if  a  discretiation  is  made  optimally. 

As  an  alternative,  an  approach  based  on  the  utilization  of  the  concept  of  classification 
predicate  makes  it  possible  to  avoid  artificial  discretization  at  all.  Actually,  a  classification 
predicate  itself  defined  over  a  subset  of  continuous  factors  (features)  can  be  considered  as  a 
discrete  specification  of  continuous  data.  A  classification  predicate  can  be  considered  as  a 
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new  feature  which  represents  the  same  data  in  a  new  way.  There  exist  a  number  of  known 
approaches  to  cope  with  the  task  of  extraction  rules  from  database  with  columns  specified  in 
discrete  scales  [see,  for  example,  [Michalski  et  al  -81],  [Quinlian-83],  [Michalski-90]).  Hence, 
the  concept  of  classification  predicate  makes  it  possible  to  solve  a  number  of  difficult  Data 
Mining  and  KDD  problems  in  an  efficient  new  way. 

One  more  original  approach  to  solve  the  task  of  rule  extraction  from  learning  data  was 
proposed  by  author  of  this  Report  [Gorodetski  et  al  -96].  It  was  described  in  brief  in  Appendix 
of  the  Interim  Report  [IR-98]. 

5.  Algebraic  Bayes’  Networks  theory  developed  by  author  and  presented  in  Interim  Report 
[IR-98]  and  in  this  one  possess  a  number  of  advantages  regarding  to  the  application  that  is 
the  subject  of  this  research  and  regarding  to  a  more  wide  area  of  Knowledge  Engineering. 
The  area  of  its  applications  is  dealing  with  uncertain  and  sub-defined  data  including 
expert’s  knowledge. 

6.2.  Proposals  for  future  research 

This  research  may  be  considered  as  a  step  in  the  development  of  technology  for 
information-based  health  assessment  system  design.  Of  course  it  is  not  able  to  solve  a  number 
of  problems  associated  with  this  very  important  and  difficult  task.  However,  there  may  be 
pointed  out  a  number  of  theoretical  and  applied  problems  that  in  my  opinion  has  to  be  a 
subject  of  research  in  the  framework  of  health  assessment  system  design.  ITiey  are  as  follows: 

1.  Mathematical  model  for  mining  knowledge  from  database  of  multi-scale  data 
structures.  In  compare  with  the  model  developed  in  this  research,  the  proposed  research 
aims  at  development  a  mathematical  model  of  technology  which  integrates  (1) 
processing  real  valued  database  resulting  in  obtaining  classification  predicates  and  (2) 
processing  discrete  valued  data  resulting  in  extraction  rules  from  both  real  valued  and 
discrete  valued  data.  This  research  may  aim  at  development  a  mathematical  basis  for 
advanced  data  mining  technology  applicable  for  wide  area  of  information-based  health 
assessment  systems. 

2.  Development  of  software  tool  prototype  aimed  at  supporting  an  interactive  technology 
of  information-based  health  assessment  system.  Development  of  such  tool  could  make 
it  possible  to  investigate  numerically  the  pros  and  cons  of  any  mathematical  basis,  its 
advantages  and  deficiencies.  This  tool  may  be  used  to  prepare  future  developers  to 
implement  technology.  It  might  be  a  first  step  to  development  of  powerful  multi¬ 
purpose  software  tool  for  utilization  in  the  area  of  information  -based  health 
assessment  systems  design. 

3 .  Advanced  statistical  and  logical  models  for  extracting  sensitive  patterns  from  database. 
This  mathematical  model  aims  at  solving  such  practically  important  prognostic  related 
tasks  as: 

•  ranking  particular  environmental  conditions  as  factors  responsible  for  general  and 
particular  types  of  failures, 

•  determination  of  particular  groups  of  environmental  conditions  (’’patterns”)  and 
assessment  of  their  combined  effects  on  failures  in  general  and  on  particular  types  of 
failures, 

•  justification  of  the  development  of  devices  protecting  from  adverse  environmental 
conditions, 

•  development  of  the  recommendations  on  the  avoidance  of  the  combined  effects  of 
adverse  conditions. 

Conventionally  these  tasks  are  solved  by  methods  of  mathematical  statistics 
which  uses  ideas  from  component  and  factor  analyses.  But  the  latter  are  not  appropriate 
in  many  practical  situations.  In  addition,  in  these  tasks  it  is  necessary  to  deal  with 
continuous  and  discrete  factors  what  takes  to  develop  a  more  powerful  mathematically 
justified  approach.  It  should  be  noted  that  this  problem  now  is  the  subject  of  intensive 
research  in  Data  Mining  area. 
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Appendix  Al.  Trajectories  of  failure  development 


Fig.  A  1.4.  Realization  of  trajectories  of  development  of  adverse 
exposure  X/p  as  the  functions  of  the  number  of  aircraft  sortie 
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Fig.Al.6.  Realization  of  trajectories  of  development  of  adverse 
exposure  Xjo  as  the  functions  of  residual  performance  resource 
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Fig.Al.7.  Realization  of  trajectories  of  development  of  adverse 
exposure  Xigas  the  functions  of  residual  performance  resource 


Fig.Al  .8.  Realization  of  trajectories  of  development  of  adverse 
exposure  Xj^as  the  functions  of  residual  performance  resource 
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Results  of  testing  decision  trees  and  results  of  testing  of  voting  procedures 
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Appendix  A3.  Algebraic  Bayes’  Network  related  equations 

A3.1.  Equations  over  probabilities  of  logic  formulae  for  knowledge  pieces  of  rank  2  and 
3  used  to  process  experts’  information  to  design  locally  consistent  ABN 

Knowledge  piece  of  rank  2 

p(Xj)+p(^i)=l, 

Ej'^-  P(X2)  +  P(-<X2)=1’ 

E/'^:  p(x,)+p(x,)-p(XjX,)+p(^j-.X2)  =^- 

Ef^ .  p(x2)-p(XjX2)- p( -^iX^)  =  0 , 

p(Xj)-p(XjXj-p(Xj^2)  =  0, 

Ej"^:  p(XjVX2)  =  p(Xj)  +  p(x2)-p(XjXj, 

:  p(^jVX2)  =  l-p(Xj)  +  p(XjX2), 

Ej'^:  p(XjV^J  =  l-p(xJ  +  p(XjX2), 

E/"^:  p(-nXjW^2)  =  ^-P(^1^2)- 

Thus,  in  the  two-  proposition  case  only  three  unknowns  have  to  be  calculated.  To  transform 
experts’  data  to  the  ABN  structure  it  is  enough  to  select  as  unknowns  the  following  ones:  p(Xj), 

PCx^),  P(X2X2). 

Knowledge  piece  of  rank  3 

E/^^:  p(Xj)+p(^j)=l, 

Ej^^:  p(x2)  +  p(^2)^1> 

Ej^^:  p(x3)  +  p(-nXj)=l, 

Ej^^:  p(Xj)+p(x2)-p(XjX2)+p( -^1-^2) 

E/^^ :  p(Xj)  +p( xj-p( Xjxj+p( -^1^3)  =1, 

Ej'^-  P(X2)+P(X3)-P(X3X3)+P(^3-X3)=1, 

:  p(x,)+p(x2)+p(x3)-p( XjXj -p( x,X3 )-p( )+p( ;+ 

+  p(-^l-^2-^3)=l, 

Ej'^  ■■  p(x2)-p( XjXj  -  p( -ix.x J  =  0 , 

E/^^ :  p(Xj)-p( X;Xj - p( Xj^^)  =  0, 

Eid'^  ■  p(x3)-p(XjX3)-p(-iXjX3)  =  0, 

E,/^^:  p(Xi)- p(x,X3)- p(Xi^3)  =  0 , 

p(x3)-p(x2X3)-p(^,X3)  =  0, 

Ej^^-.  p(X2)- p(X3X3)- p(Xj^3)  =  0 , 

Ej^^ :  p( X2xJ-p( XjX^Xj  )-p( -^jX^x^ )  =0, 

Ei/^^:  p(x,xJ-p(XjX2X3)-p(Xj^2X3)=0, 

Ej^^ :  p( x,X2)-p( XjX2X3)-p( x^x^-nx J  =0, 

Ei/'^-.  P(X3)-p( X,X3)-p(X2X3)+p( XjX2X3)-p( -^J^2^3)=0, 

EtJ'^ :  P(X2)-P( XjX2)-p( X2X3)+p( XjX2X3)-p( 
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p(Xj)-p(XiX,)-p(XjX,)+p(XjX2X,)-p(Xj^,-^J=0, 

p(x,vx,)  =  p(xj  +  p(x,)-p(x,x,), 
p(-^i  vx,)  =  l-p(x,)  +  p(xjx,), 
p(xisy-^x^)  =  l-p(x,)  +  p(x^x,), 

Ej'^ :  p(-^i  v-nx,)  =  l-p( XjXj, 

£2/'^:  p(XjVx,)  =  p(x,)+p(x,)-p(x,xJ, 

£25'^ :  V(^1  V ^i)  =  +  P( 

£26'^'-  P(Xiy^3)  =  l-p(Xs)  +  P(XiXs), 

Ej'^-.  = 

£2J'^-  P(^2  ^X3)  =  P(^2)-^P(^3)-P(^2^s)> 

Ej'^ :  p(^2  '^x,)  =  l-p(x2)^p(x2x,), 

£30'^  ■  P(^2  '^^3)  =  ^-P(^s)  +  P(^2^3)^ 

£3/"^ ;  p(^2  ^^3)=i-P(^2^3)^ 

Ej'^ :  p(Xj  \fX2WX3)^p(x,)  +  p(x2)  +  p(x,)- 

-  p(x,X2)  -  p( X^Xj  )-p(X2Xj  +  p( XjX^Xj), 

£33'^ :  p(-<Xi  V  V  JC, ;  =l-p(Xi)  +p(  XjX^)  +  p(  XjXj  -  p(  XjX^x^ ) , 

£34^'^  ■  P(XiV^2^^3)  =J-P(X2)  +  P(XiX2)  +  P(X2X3)-P(XiX2X3), 

£35'^-  p(XjVX2^/^3)=l-p(X3)  +  p(x,xJ  +  p(X2X,)-p(X,X2X2), 

Ejg^^'-  p( ~~Xj  V  —X2  V  x^)  -i~p(XjX2)  +  p(XjX2X2) , 
p(^iVX2  V-Xj)=l-p(XjX2)  +  p(XjX2X2), 

£3/^^  ■  p(^i  V  V  ^ J  -p( X2X3)  +  p( Xj X2X2 ) , 

:  p(-^j  V  ^2  V  -^3)  =i-P( X1X2X3). 

Thus,  in  the  three-proposition  case  only  seven  unknowns  have  to  be  calculated.  To  transform 
experts’  data  to  the  ABN  structure  it  is  enough  to  select  as  unknowns  the  following  ones:  p(Xj), 

p(x2),  pix^),  p{XiX2),  piXjX^),  P(X2X3)  and 

A3.2.  Conditions  of  Consistency  of  knowledge  Pieces  of  rank  2,  3  and  4  expressed  in 
terms  of  positive  conjunctions 

Conditions  of  Consistency  of  knowledge  piece  of  rank  2 
C(^>p(xJ<l, 

Cj^>:p(x2)<l, 

P(Xi)+P(x2)-P(^iX2)^i> 

Cj'^-  P(X2)-P(XiX2)>0, 
c/^^-  P(Xi)-p(XjX2)>0, 

p(xj  >0,  i  =  1,2 ,  p( XJX2)  >  0 . 

Conditions  of  Consistency  of  knowledge  piece  of  rank  3 
p(x,)  £/, 
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C/^>:p(x,)<l, 

p(x,)+p(x,)-p(x,xj<l, 

C/'^:  p(Xj)+p(x,)-p(XiXj<l, 

C/^^:p(x2)+p(xJ-p(x2X^)<l, 

C/"^:  p(Xj)+p(x,)+p(xJ-p(XjX2)-p(XiXj-p(x,X3)+p(XjX2Xj)<l, 

Cj'^-  P(x2)-P(XjX2)>0, 

p(Xi)-p(XjX2)>0, 

p(x,)-p(x,xj>0, 

P(Xi)-p(x,x,)>0, 

Cj'^-p(x,)-p(x,x,)>0, 

p(x,)-p(x,x,)>0, 

p(X2X3)-p(XjX^X^)>0, 
p(XjX,)-p(XiX2X3)>0, 
p(XiX,)-p(XjX,x,)>0, 
p(xj-p(xjxj- p( J + p( X, X2X3 ) ^0, 
p(x2)-p(xjx,)-p(x2x,)+p(x,x,x,)>0, 

Cj'^:  p(Xj)-p(XjX2)- p( X J + p( x^ x^ X, ) >0, 

p(xj  >0,  i  =  1,2,3,  p(x,Xj)  >  0,  i,j  =  1,2,3,  i>j,  p(XjX,x,)  >  0 . 

Conditions  of  Consistency  of  knowledge  piece  of  rank  4 

C<^>:p(x,)<l, 

P(X2)^1> 

C/^^  .  P(X3)<1, 

Cj^>:p(x,)<l, 

C/'^-  P(Xj)+p(X2)-p(XjX2)<l, 

Cj"^-  p(Xj)+p(xJ-p(x^Xj)<l, 
p(xi)+p(xj-p(xixj  <1, 

P(x2)+P(x3)-p(x2X,)<1, 

p(x2)+p(xj-p(x,xj<l, 

p(xj+p(xj-p(x,xj<l, 

P(Xi)+p(X2)+p(Xj)-p(XjX2)-p(XjX,)-p(X2X,)  +  p(XjX,X,)  <1, 

p(x,)+p(x2)+p(xj-p(xix,)-p(xjxj-p(x2xj+p(x,x2xj<l, 
p(Xi)+p(x,)+p(xJ-p(XjX,)-p(XjX,)-p(x,xJ+p(XjX,xJ  <1, 

:  P(X3  )+p(x,  )+p(xJ-p( x^x,  )-p( X3X,  )  -p( x,x^  )+p(x,x,xJ<  1, 


A3. 3 


Appendix  A3.  Algebraic  Bayes’  network  related  equations 


■  P(^l)^ P(X2)+P(Xs)+P(xJ-p(x,X,) -p(XjX2)-p( XjXj-p( x^xj- 
- P(X2XJ- pfx^x, )+p( x^x^x,)  +  p( XjX^xJ  +  p( x^x^x,  )  +  p( x^x^x, )  - 
-p(XjX2X2Xj<l, 

p(x,)-p(x^X2)>0, 

p(Xj)-p(XjX2)>0, 

p(x,)-p(xtx,)>0, 

P(x,)-p(XjX,)>0, 

C2/‘'^-  P(xJ-p(XjXj^O, 

C2/'^-.  p(xi)-p(x^xj>0, 

C2/'^-  P(X3)-P(X2X3)^0, 

p(x2)-p(x2xj>0, 

C24'^-  P(xj-p(x,x,)>0, 

C2/'^-.  P(X2)-P(X2XJ>0, 

C2J'^  -  p(xj-p(x2xj>0, 

C27'^-  P(X3)-p(X3Xj>0, 

C2J'^-  P(X2X3)-p(XjX2X,)>0, 

€2/^^:  p(x,xJ-p(XjX,Xj)>0, 

p(XjX,)-p(XjX,x,)>0, 

Cs/^^  ■■  P( X2X4)-P( XjX2Xj>0, 

p(XjXj-p(XjX2xJ>0, 

p(XiX2)-p(XiX2xJ>0, 

P(X3Xj-p(XjXjXj>0, 

p(XiXj-p(XjX,xJ>0, 

p(XiX,)-p(XjX,xJ>0, 

:  p(xjxj-p(x2x,xj>0, 

P(x2xJ-p(x2X3xJ>0, 

p(X2X2)-p(X2X2Xj>0, 

C4J'  ■■  P(x3)-P(XiX2)- p(  X2XJ+ p(  Xj  X2X2 )  >0, 

Cj*>:  p(x2)-p(XjX2)-p(x2X2)+p(XjX2X,)>0, 

C4/‘'^  ■  P(Xi)-p( XjX2)-p( XjXj  +  p( XjX2X2)>0, 

(^43'^  ■■  P(X4)-p(XjXj- p( x^x^ )  +  p( XjX2X, ) >0, 

:  p( X2)-p( XjX2)-p( x^xj+pf XjX2xJ>0, 

^4!'^  ••  P(  Xi)-p(  XtX2)-p(  XjXj+p(  XjX^xJ  >0, 

Cj"-.  p(xj-p(xjxj-p(xjxj+p(x,x3xj>0, 

Cj*^ ;  p(x3)-p(XjX3)-p(x2xJ+p(XjX3xJ>0, 
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p(Xi)-p(XjX,)-p(XjXj+p( XjX,xJ>0, 

p(xj-p(x2xj-p(xjxj+p(x2x,xj>0, 

CsJ'^  ■  P(^3  )-P(  ^2^3  )-P(  ^3^4  )+P(  X2X,Xj>0, 

:  p(x^)-p(x:pc,)-p(x2x,)^p(x^x,x^)>0, 
p(Xi )>0,  i  =  1,2, 3, 4 ,  p( XfXj )>0,  i,j  =  1,2, 3, 4,  i>j, 
p(x,x^x^ )>0,  i,j, k  =  1,2, 3,4,  i>j>k,p( x^x^x.xj >0 , 


c 

'^S2 

P(^I^2 

X3)-p(XiX,x^ 

X4)^0, 

C  (4) 
'^53 

P(^lX2 

X4)  -p(XiX2X, 

,X4)>0, 

f  (*) 
'-'54 

p(XjX, 

X4)-P(XiX2X3 

X4)^0, 

r  (4) 

'-'55 

P(X2X, 

^X4)-p(XiX,x, 

X4)>0, 

r  (‘*) 

'-'56 

:  p(XjX, 

,)-p(xpc^X3)- 

-piXiX^x^ 

J-i-pfXjX^X^ 

<X4)^0, 

r  <*> 

'-'57 

:  p(x^x, 

,)-p(x,xpc,)- 

-p(x  3X^X4 

)+p(XjX,X, 

X4)^0, 

r  (^) 

'-'58 

p(x,x^ 

)-p(xpC2Xj- 

-p(XjX3X4 

)+p(XjX,Xj 

X4)>0, 

r 

'-'59 

p(x,x^ 

,)-p(xpc^x,)- 

-p(x,XjX4 

,)  +  p(XjX2X^ 

^X4)>0, 

f  (4) 
'-'60 

P(X2X, 

,)-p(XjX2Xj- 

-P(X2XjX4 

J  +  pfXjX^X, 

^X4)>0, 

r  w. 
'-'61 

p(XjX, 

)-p(x  1X3X4)- 

-P(X2X3X4 

J  +  pfXiX^X^ 

X4)>0, 

p(xj-p(xjx,)-p(x^x,)-p(xjxj+p(xjx^x,)+p(x^x,xj+ 

+  p(x,X3Xj-p(XjX2X3Xj>0, 
p(x2)-p(xjx,)-p(x,xj-p(x,xj+p(xjx,x,)+p(x^x2xj+ 

+p(  x,x^xj-p(  XjX2X,xJ>0, 

Cj'^:  p(xj-p(xjxj-p(x,x,)-p(x,xj+p(xjx,xj+p(xjx,xj+ 

-\-p(x^XsXj-p(XjX2X^xJ>0, 

p(xj-p(xjxj-p(x2xj-p(xjx,)+p(xjx2xj+p(x,xsxj+ 

+p(  X^xfcj-p(  xpc^xpcj>0, 

p(xpcpc^xj>0. 
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